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A FACTORIAL ANALYSIS OF FIFTEEN MMPI SCALES 


FRED T. TYLER’ 


UNIVERSITY OF CALIFORNIA 


T is only quite recently that any reports of 
factor analyses of the Minnesota Multi- 
phasic Personality Inventory have been 

iblished. Marjory Stout [12] reported a 
ictor analysis of the original nine scales of 
he MMPI for college men and for college 
men. The results showed that four factors 
ounted for the major part of the interrela- 
onships for each sex. Actually, the loadings 
mn all but the first factor were rather small, on- 
two in the second and one in each of the 
hird and fourth factors exceeding .30. Cook 
! Wherry [1], using 120 naval enlisted sub- 
ne personnel as subjects, analyzed a bat- 
of 20 variables, including 11 from the 
\MIMPI. Six factors were isolated, loadings of 
20 or larger being regarded as significant. The 
rst factor included all the MMPI scales with 
the exception of the LZ and F scales. This fac- 

was labeled the “tendency to personality 
maladjustment.” The third factor had positive 


lings on the Ma scale and negative loadings 


L, Pd, Hs, D. and Hy. The latter three 
requently referred to as the ‘‘neurotic tri- 
I'he whole factor was titled “tendency to 

veractivity.” The factor was interpreted as 

olving lack of self-consciousness and self- 
riticism, with a tendency toward impatience 
nd impulsiveness. Factor IV included L, Hy, 
Mf, and Pa, and was labeled “tendency to 
paranoia,’ because the largest loading, .48, 
vas with Pa. The sixth factor was entitled 
tendency to femininity of interest pattern.” 
lhe loadings were positive for Mf and Pt, 
ind negative for Hy. The suggestion here was 
hat the more effeminate men tend toward 
mpulsive behavior. 

Cottle [2] reported a factor analysis of the 
MMPI, the Strong Vocational Interest Blank 


‘The writer is indebted to Harrison G. Gough for 
‘coring stencils for several here 
tor assistance in interpreting the factors. 


scales used and 


for Men, the Kuder Preference Record, and 
the Bell Adjustment Inventory. The subjects 
were 400 male veterans who had been referred 
to the Psychological Services Center of Syra- 
cuse University. The factors were extracted 
by Thurstone’s complete centroid method, the 
final primary factors being based upon an ob- 
lique rotation of the axes. The MMPI con- 
tribution to each factor is summarized: 


1. Factor B included F, Hs, Pd, Pt, Sc, and Ma, 
with L having a negative loading. The largest load- 
ings were contributed by the scales included in the 
“psychotic phase” of the MMPI. The factor was 
labeled “more serious maladjusted tendencies.” 

2. Factor D consisted of Hs, D, and Hy, the 
scales included in the neurotic triad. The factor 
was considered to indicate “neurotic tendencies.” 

3. Factor E included the Mf scale 
Kuder MF, and was considered to be a 
plane.” 


(and the 
“residual 


In general, Cottle found practically no over- 
lapping between personality and interest in- 
ventories. 

Wheeler, Little, and Lehner [14] investi- 
gated the internal structure of the MMPI us- 
ing two groups of subjects, male college stu- 
dents and male neuropsychiatric patients in a 
veterans hospital. Their correlation matrix in- 
cluded twelve variables, the nine clinical scales 
and the L, F, and K scales. —Ttwo major and 
two minor factors emerged from Thurstone’s 
centroid multiple factor analysis; the interpre- 
tation is based upon the factors obtained after 
rotation. Reference will here be confined to 
the results obtained from college men. Factor 
I had its greatest loadings on the Sc and Pt 
scales, components of what is usually known 
as the “psychotic triad.”” The writers believe 
this factor indicates concern with one’s self 
and reflects “withdrawal of a schizoid type in- 
cluding excessive concern with compulsive 
needs.” The second factor had its maximum 
loading on Hy and high loadings on Hs and 
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D, the three components of the neurotic triad. 
The second factor is believed to reveal a pic- 
ture of neurotic behavior. The third and 
fourth factors are difficult to interpret inas- 
much as each includes only two scales. One 
of these includes the Mf and the Pa scales, re- 
flecting a possible relationship between mascu- 
linity-femininity and paranoia in normal male 
subjects. 


PROBLEM 


The purpose of the present paper is to re- 
port the results of a factor analysis of the 
M\IPI scores of 107 female graduate stu- 
dents preparing to become elementary teachers 
at the University of California, Berkeley. All 
subjects had passed a physical examination, 
and none had been referred to a psychiatrist 
for further examination. The subjects an- 
swered all questions in the booklet form of the 
MMPI. This made it possible to obtain scores 
for 15 scales exclusive of the F, L, and Validi- 
ty scales. In addition to the original 9 scales, 
scores were obtained for: 


1. Si: a scale for social introversion [3, 5]. 
2. St: a scale for social status [8]. 
3. Pr: a measure of prejudice [7]. 

TABLE 1 
MEANS AND SIGMAS OF SUBJECTS ON EACH SCALE: 
Mean 7-Scores FOR THREE STUDIES 





-seore 


Subtest 


x 
Ps: 


M SD M M 


Deser 


3.56 2.64 44 46 


Hs Hypochondriasis 

D Depression 19.46 4.29 50 5 
Hy Hysteria 20.99 3.46 54 55 
Pd_ Psychopathic Deviate 13.02 3.58 49 53 
Mf Masculinity-Femininity 38.28 3.66 47 49 
Pa _ Paranoia 9.52 2.64 54 3 
Pt  Psychasthenia 988 543 46 49 
Sc Schizophrenia 9.27 6.09 48 50 
Ma Hypomania 14.94 3.36 52 57 
S: Social Introversion 25.04 7.55 50 

St Status 24.02 3.11 64 

Pr Prejudice 6.39 3.99 37 

Ac Academic Achievement 17.37 2.63 51 

Re_ Responsibility 24.41 2.32 63 

Deo Dominance 19.64 244 63 


*T-scores for the original nine scales were obtained 
from the Manual of Directions; T-scores for the remain- 
ing scales were obtained from data supplied by Gough. 
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Re a 


ynsibility 


a measure of academic achievement driv: Th 


scale prepared to measure socia 





6. Do: a measure of social dominance ¢ 
Table 1 shows the mean scores (raw and St 
T) for the group of women used in the present 
analysis. It may be noted from Table 1 that 
there is a distinct tendency, with a few ex- § fer 
ceptions, for the group means to center closely rt 
on a T-score of 50. The group is somewhat J ke 
below average on Hypochrondriasis and o; p 
Prejudice. On the other hand, the mean scores sampl 
are larger than 50 on Status, Responsibility Rr 
and Dominance. Further data may reveal 
whether these divergencies are characteristic of 
college students in general or more particular- fp 
ly of women students planning to become Ph.D 
elementary school teachers. Certainly this 


group does not reveal any marked tendency 
toward psychoneuroses or neuroses as defined T: 
by the elevated scores on the triads of Pr, § anon 
and Pa, and Hs, D, and Hy, respectively. 


TABLE 2 
DESCRIPTIONS OF MMPI SCALEs 





Hs Concern over bodily functions, concern at 
health, tendency toward physical complaint 
D Depression, dejection, discouragement, desp 
dency 
Hy Immature, unrealistic, amenable to group 
ideas, kindly, courteous, naive, needs social ac 
ceptance 
undependable, impulsive, eg 
centric, defiant, asocial, individualistic 


Pd Irresponsible, 


Pa Aggressive, critical, irritable, moody, sensitive 
sensitive to criticism 

tense, hesitant, insecure, self 

conscious, feelings of inadequacy 


Sc  Bashful, 


Pt Apprehensive, 


withdrawn, oversensitive, secretive 













, Hs 
cautious 
Ma Confident, hypersensitive, not persistent, ag nig 
gressive, charming, expansive Py 
Mf Masculinity or femininity of interests Mi 
Pr Cynicism, poor morale, suspiciousness, feelings P 
of ill will, lack of faith in others Py 
Si Tendency to avoid social contacts, little de : 
pendency upon people Ma 
Ac Tenseness, docility, insecurity (producil = 
greater academic output) ¢, 
St  Self-confidence, poise, resourcefulness Py 
Re Dependability, integrity, seriousness, steadiness 4, 
tenacity P, 
Do Optimism, persuasiveness, self-discipline, res ~ 
luteness 
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The present group of women had lower 
mean scores on all scales than did the subjects 
sed by Stout except on Mf and Pa. Especial- 
nteresting are the differences on the Hs, 
Sc, and Ma scales, suggesting that in 

t's group there was more concern over 
sical well-being, more tenseness, insecurity 
ution, and more social aggressiveness. If 
MMPI scores continue to be 
rted from different universities, research 


ferences in 


rkers may need to consider more carefully 
problems arising from the nature of the 
les used in an investigation. 
rief descriptions and synonyms for the 
rious scales are shown in Table 2. These 
scriptions were obtained from the Manual 
Directions for the MMPI, from Rosen’s 
Ph.D. dissertation [11], from Drake [3], 
from Evans and McConnell [5], and from 
igh [ 6, 7, 8]. 
Table 3 shows the intercorrelations, com- 
ed by machine from raw scores, for the 15 
riables used in the present study. The 
of the MMPI indicate that the L 
may have some value as a measure of 
ersonality trend, but they provide no inter- 
tion. They say specifically that the “J 
s not a personality scale.” The manual 
e K scale states: “K is not known to 
much clinical significance in itself.” The 
‘ction factor was not applied to the rele- 
ales because there is little evidence to 
te either greater reliability or validity 
he corrected scores. Statistical studies of 


+} 


nt 
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the K scale are in process, and it is hoped that 
the results may be published at an early date. 
It appears that an analysis based upon K scores 
and scores on corrected clinical scales would 
offer difficulty 


five of the clinical scales would be 


in interpretation inasmuch as 
orrelated 
with K because K is part of the clinical score. 
For these reasons L, F, and K were not used 
in the present analysis. The correlations 
ranged from —.56 (Maand Re) through zero 
up to .7/7 (Pt and Sc). 
the table contains the intercorrelations. 


The upper half of 
The 
analyzed by ‘Thur- 
stone’s complete centroid method. The analy 
sis was carried through until five factors had 
been extracted, at which time the 
12 to 11, with a mean of 
.04 when true communalities had been used 
in the analysis. The residuals are shown in the 
lower half of Table 3. 


correlation matrix was 


residuals 
varied from 


The resulting factorial patterns were rotat- 
ed sc as to produce both orthogonal and ob- 
lique,axes. Unrotated and rotated factors (ob- 
shown in ‘Tables 
4, 5, 6, 7, and 8, together with a brief inter- 
pretation of each factor. The descriptions tend 
to be eclectic, being based upon rotated and 
unrotated factors. The three sets of loadings 
are presented so that the reader who prefers 
one of these factorial patterns to another may 
make his own interpretation. 

Factor I (Table 4) includes 13 of the 15 
variables and so is probably best considered 
as a “general 


lique and orthogonal) are 


maladjustment factor.” In 


TABLE 3 


D Hy Pd 


49 24 34 
37 
03 16 


05 
07 
05 
03 
01 
04 
-07 
03 


INTERCORRELATIONS AND RESIDUALS AFTER EXTRACTION OF Five FACTORS 


St Pr Ai 


37 


18 
23 
~16 
49 
34 
18 


41 


33 


03 
08 
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TABLE 4 
“actor I: UNROTATED AND ROTATED 


Unrotated Orthogonal 


Oblique 


S00 386 386 


$92 392 


—340 
697 698 
615 421 423 
352 
719 849 847 
—$89 —538 —538 
739 702 702 
Ai 595 562 561 
Re —363 
Do —379 


—338 


—+60 


—459 





normal subjects it should probably not be in- 
terpreted as measuring either “general neu- 
roses’ or “general psychoneuroses’ even 
though loadings on Hs, D, Pt, and Sc are 
rather high. It appears, rather, to reflect a 
chronic state of marginal maladjustment in- 
cluding concern with one’s self. Subjects who 
score high on the factor would tend to have 
difficulties in their social contacts, possibly by 
reason of their dissatisfactions, worry and dis- 
gruntlement, and their complaining and pessi- 
mistic attitude. Low scores would be indica- 
tive of a more socially poised, self-confident, 
and optimistic attitude. The factor is similar 
to Wheeler’s Factor I, as far as the heavy load- 
ings are concerned. 


TABLE 5 


Factor II: UNROTATED AND ROTATED 


Unrotated Oblique Orthogonal 


Hs 399 (292) 
D 443 309 352 
Hy 488 340 387 
Pd 325 

Mf 456 

Pa 399 427 411 
Pt (283) 453 437 
Sc (289) 304 
Ma (-298) -306 -318 
Pr -312 (-299) -316 
Re 414 482 489 





Factor II (Table 5) might be termed “a 
conflict factor’ because it seems to suggest a 
conflict between opposing tendencies, together 


with a lack of insight. Dissatisfaction appears 
to be present in the various scales contributing 
to the factor, but probably to a lesser degree 
than in the preceding factor; with this dis 
satisfaction is a tendency toward excessive self 
control. High scores would suggest the pres. 
ence of a series of contrasts or conflicting at 
titudes: (1) need for independence, but with 
underlying dependency ; (2) a facade of altry 
ism and selflessness, along with a basic narcis 
(3) an emphasis on objectivity and 
frankness along with a heightened sensitive. 
ness and rigidity; (4) self-pity along with 
fantasies of fortitude and resoluteness. The 
factor points to a type of neurotic (Hs, D, and 
Hy) personality with repressive, hysteroid 
characteristics. The general picture presented 
by the unrotated factor is similar to Wheel- 
er’s second factor. 


$ism ; 


TABLE 6 
Factor III: UNRoTrAtep AND ROTATED 


Unrotated Oblique Orthogonal 


317 331 


St 


Ac 
Re 
Do 


The general picture of Factor III (Table 
6) suggests a tendency toward social aggres- 
siveness and consequent friction. Women scor- 
ing high on this factor would probably be 
somewhat inconsiderate of others, abrupt, 1r- 
ritable, and impatient in social contacts, and 
determined, impulsive, and resentful in their 
behavior. They are not likely to have any 
very satisfactory insight into their own role 
in producing their difficulties. They may re 
veal rather egocentric and hostile attitudes, but 
there is little likelihood of any real self-dis 
satisfaction. Such behavior as that outlined 
may make for unhappiness in social situations 
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but a person with such characteristics could 
attain satisfactory social and professional goals 
ther traits and abilities were favorable. 


TABLE 7 


scTor IV: UNROTATED AND ROTATED 





Orthogonal 


544 


Unrocated Oblique 


347 
-359 


418 


545 


483 
328 


482 
328 


In general, Factor 1V (Table 7) indicates 
good social adjustment accompanied by strong 
jrives and self-confidence. A woman scoring 
high on this factor is likely to be objective and 
open-minded in her behavior toward others. 
Women scoring high on this factor should also 
be characterized by forcefulness and efficiency 

general. 


TABLE 8 


Facror V: UNROTATED AND ROTATED 


Unrotated Oblique Orthogonal 


334 33 
~349 


-525 


lhe nature of Factor V (‘Table 8) is not 
well revealed because of the limited num- 
of variables contributing to the factor. 
the other hand, its interpretation is rela- 
clear-cut; the association of convention- 
femininity” (weakness, helplessness, and 
ndency) with a distinct tendency toward 
‘al complaints. Wheeler, Little, and 
hner found Mf to be related to Pa in college 
in the case of men, a tendency toward 
femininity is related 
nsitive behavior. 


to aggressive, critical, 


SUMMARY 


|. The MMPI was administered to 107 
women students training at the University of 
California to become elementary school teach- 
ers. Fifteen scores were obtained for each stu- 
lent, the nine original clinical scales of the 
MMPI and six which have been developed 
more recently. 


AND CONCLUSIONS 


2. The intercorrelation matrix was factor- 
nalyzed by the complete centroid method; 
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SCALES 455 
the unrotated factors were compared with the 
factors obtained from both eblique and or- 


thogonal axes. 


3. Five factors were obtained, and descrip- 


tions were attempted. In general, the results 


are in agreement with interpretations of the 
MMPI by clinical psychologists. 


4. Inasmuch as this is the only factor anal- 


ysis available tor the fifteen scales used in this 
study, it is not possible to compare very specifi- 
cally the results with those of other investi- 
gators. Stout extracted four factors from the 
original nine scales of the M[MPI, but in the 
main she found a heavy general factor running 
through these scales. ‘This may be comparable 
to Factor I described in this study. Wheeler, 
Little, and Lehner reported two more or less 

P= 


wel lefined vroun factors in 


the L, / 


the original nine 


and K Ss ales. 


clinical scales and 
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THE HUMM WADSWORTH AND THE 
MINNESOTA MULTIPHASIC 


A. R. GILLILAND 


NORTHWESTERN 


HE present study consists of a compari- 
son of two well-known personality tests, 
the Humm-Wadsworth Temperament 
Scale and the Minnesota Mulitphasic Person- 
lity Inventory. These tests were given to a 
ip of college students for an interpretation 


of their personality characteristics. Because of 


» nature of the use of these tests in this 
study, the results bear only indirectly on their 
e for industrial or clinical purposes. 
[he tests were generally given to classes of 
t fifty advanced students made up largely 
those majoring in psychology, education, 
h, or journalism. Both the individual and 
p forms of the MMPI were used on some 
ps. All the tests were scored as directed 
the latest manuals. The raw scores on each 
test were converted into their respective types 
tandard scores. 
The Humm-Wadsworth and the MMPI 
measure five components or traits which 
e the same or very similar names or descrip- 
ns. These are: (1) hysteroid—psychopath- 
leviate, (2) mania—hypomania, (3) de- 
sion, (4) autistic—schizoid, and (5) 
iranola. 
It has been pointed out, however, by one of 
e authors' of one of these tests that al- 
gh identical or similar terminology has 
n used in naming some of the traits in the 
tests, this does not necessarily mean that 
factors measured are 
similar. 


identical or even 

For example, in the Humm- 
Wadsworth scale normality or abnormality is 
largely measured by the first component, or at 
least its control effect over the other traits is 


ndicated. No such method is used in the 
MMPI. High scores and seemingly low 


res (although little mention is made of 
these) indicate abnormality on the MMPI. 


In a_ personal 


letter from Dr. Doncaster G. 


mm, dated December 30, 1949. 


Wy 
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Of the five traits, probably there is the greatest 
difference between hysteroid and psychopathic 
deviate. Psychopathic deviate is the broader 
term, whereas hysteroid refers primarily only 
Never- 
ground cover- 


— 


ed by the two terms to include them in the 


comparison of traits. Li 


to antisocial or criminal tendencies 
theless, there is enough common 
kewise, autistic is not 
as inclusive a term as schizophrenia. But the 
autistic tendency is certainly one of the out- 
standing characteristics of schizophrenia, and 
it seems reasonable to assume a high relation- 
ship between them. 

No attempt was made, as suggested in the 
manual, to use the “normal”’ scores as a basis 
for interpreting the other components on the 
Humm-Wadsworth scale. This was partly 
due to the fact that the component control 
measure did not seem especially suited for this 
purpose. There was also another reason. Seem- 
ingly, each component of the Humm-Wads- 
worth was originally validated against an in- 
dependent criterion. If so, this would justify 
the interpretation of scores on each part of the 
test as a measure of this component in the sub- 
ject’s personality. 

Table 1 gives the Pearsonian correlations be- 


TABLE 1 


CORRELATIONS BETWEEN HUMM-WADSWORTH TEST 
AND MMPI 


i s s 
= 2 £ 4 a 
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"Group was administered the MMPI Group Form 


#Group was administered the MMPI Individual Form. 





tween the five common components of the two 
tests for six different classes containing a total 
of 329 students. The first four lines of the 
table present correlations between the Humm- 
Wadsworth and the group MMPI for a total 
of 229 students. The last two groups of 100 
students give a comparison between the in 
dividual form of the MMPI and the Humm- 
Wadsworth tests. No complicated statistical 
devices were used in interpreting the meaning 
of these correlations. This omission was not 
due to a failure to consider such possibilities 
but, rather, because it appeared that the best 
method of interpretation would come from an 
inspection of the data to determine trends or a 
la k of trends 

A study of the table of correlations shows al- 
most exactly a chance distribution from zero 
as the mean. No general agreement between 
scores on the two tests can be found. This 
fact seems to hold for comparisons of the 
Humm-Wadsworth with both group and in- 
dividual forms of the MMPI. The general 
statement may need to be qualified somewhat 
by saying that it holds true for at least three 
of the common traits, but possibly there is a 
slight general agreement between depression 
scores on the two tests, as shown by low posi- 
tive correlations, and a slight consistent dis- 
agreement on paranoia, as shown by four out 
of six low, negative correlations. These results 
may be restated by saying that there is a 
slight tendency for a person making a high 
score on one to make a low score on the other 
for paranoia; and there is no agreement on the 
other three components. 

The second part of this study consists of a 
comparison of each test with ratings on the 
five personality traits. Three ratings on each 
of the five traits were obtained for each sub- 
ject. One of these was a self-rating on brief 


TABLE 2 


RELIABILITY OF RATINGS: SELF-RATINGS CORRELATED 
wittH MEAN oF Two AssocrATEs’ RATINGS 
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descriptions of each trait. The outlines of 
traits were constructed by a comparison of 
traits as described in the two manuals 
iditional ratings were obtained from «:os 
sociates of the subjects who were well t: 
in the technique of personality rating. 
ratings of friends were returned in sealed 


velopes without the results being made kn 


+r +} 


o the persons being rated. 

‘Table 2 gives the correlations of the 
ratings with the means of the ratings by 
associates for each of the two classes 
careful rating method gives unusually hig] 
liabilities, as shown in the tables. The 1 
was from .42 to .90 with a mean of .60. 7 
results show a fair consistency between 
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two sets of ratings for the different traits. 


TABLE 3 


IRRELATIONS BETWEEN HUMM-W A ApDSWORT! 


AND MEAN OF ALL RATINGS 
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Table 3 gives the correlations between 


Humm-Wadsworth scores and the mea 
the three ratings for each of the five 
ponents for a total of 174 subjects. All 
2 of the 20 correlations were positive but 
The highest correlations were for the de 


TABLE 4 
CORRELATIONS BETWEEN MMPI ANpb MEA) 
oF ALL RATINGS 
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sion and autistic components. The mean cor- 
relation for these two components was .38. 
The first part of Table 4 gives the correla- 
tions between the ratings and the group form 
MMPI for 120 subjects. Four out of 
Ss of these correlations were negative. The 
omponents giving the highest correlation 
were psychopathic deviate and depression. ‘The 
ean of these two was .30. 
The last part of Table 4 gives the correla- 
tion of ratings with the individual MMPI for 
subjects. The correlations were not con- 


sistently higher than 


[hese results raise the question as to whether 


for the group form. 


with normal subjects the individual form of 

the MMPI is more valid than the group form 

They do 

not seem to justify the additional time neces- 
ry to give the individual form of the test. 


the evaluation of personality. 


In summary it may be stated that insofar as 
arefully collected ratings with mean reliabili- 


ties of .60 can be used for validation, both the 
Humm-Wadsworth and the MMPI show fair 
validities for measuring certain pe rsonality 
components and not much validity for others. 
Each correlates considerably higher with rat 
ings of these components than with each other. 
These facts may be stated in another way by 
saying that the seemingly common components 
of the two traits each measure different aspects 
of these traits. 

These results should be interpreted, not as 
a criticism of the tests, but as suggesting the 
care and tentativeness that should be attached 
to any diagnosis based exclusively upon such 
personality test scores. The results also sug- 
gest that a new test might be constructed by 
combining certain parts of each of the present 
tests, giving a measure of higher validity for 
normal subjects than either the Humm-’Vads- 
worth or the MMPI used separately. 
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THE RORSCHACH AND DESCRIPTIVE DIAGNOSIS 


J. R. WITTENBORN? 
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AND 


JULES D. HOLZBERG 


CONNECTICUT STATE HOSPITAL 


LTHOUGH the Rorschach test has 

been used primarily to evaluate the 

structure of the individual’s personali- 
ty and his characteristic ways of coping with 
certain motives, it has also enjoyed consider- 
able use as an aid in the descriptive diagnosis 
of mental hospital patients. The Rorschach 
test has been used as a diagnostic supplement 
in the sense that the Rorschach examiner has 
commonly stated the kinds of symptoms the 
patient is currently manifesting or is likely to 
manifest in the very near future. 

The present report describes the relation- 
ships found between specific Rorschach scores 
and the kinds of symptoms that patients were 
known to manifest at the time the Rorschach 
test was given. The sample upon which the 
present analyses are based comprises 199 cases 
drawn from consecutive admissions at the Con- 
necticut State Hospital at Middletown. The 
basis for selecting these 199 cases was twofold. 
All the patients were to be under 60 years of 
age, and none of the patients was to be suffer- 
ing from a disorder which is known to be 
primarily neurological or toxic in its nature. 


The validity of the various scores of the 
Rorschach test was evaluated by means of an 
employment of two criteria: 


1. Psychiatrists’ diagnosis. From the available 
sample the five most commonly occurring diagnostic 
groups were selected: paranoid schizophrenia; 
manic-depressive psychosis, manic phase; involu- 
tional psychosis; simple schizophrenia; and manic 


1The writers wish to acknowledge the collabora- 
tion of the psychiatric staff at the Connecticut State 
Hospital at Middletown and the support of Mark 
A. May, Director, Institute of Human Relations 
Yale University. 
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depressive, depressed phase. Differences betwee: 
various diagnostic groups are sought for each 

different Rorschach scoring categories. 
a psychiatric diagnosis is not a quantified attr 


Inasm ( 


the presence of relationship between a diagnosis 
a Rorschach scoring category is sought by 

plication of the x? test. This is illustrated 
Table 1, 


tients in a manic phase produced more of the CI 


which shows that the manic-depressed pa 


Rorschach determinant score than would be 
ed from the general prevalence of the Cl 


among all the 75 patients included in the five 


predict 


nostic groups. Table 1 also shows that the n 
depressed patients in a depressed phase showed 


of the CF score than would be expected fo: 


total group of 75 patients falling in the 
diagnostic groups in question. In Table 1 
in all the x? tests, the expected freque 
scores is based on a distribution of scores 
the group of 75 patients resulting when 
diagnostic groups are combined. It sh 
noted that the first diagnostic group, parat 


ophrenia, is the itributor to t 


and, as a consequence, the distribution of 


mayor cf 


total is biased in the sense that 
will most resemble the distribution for the par 


based on the 
schizophrenic group. From some standpoi: 
is an undesirable basis for comparing the differ 
diagnostic groups with respect to Rorschach 
A lance 
is considered to be a typical one and represents t! 
practical situation 

2 The criteria for the 
second analysis are the patients’ scores for eac! 


Symptom-cluster scores. 


nine different, empirically determined syndromes 
symptom clusters [1, 3, 4]. The patient’s scor 
each of these nine clusters is based on psychiatrists 
ratings and provides a kind of a quantified multip| 


psychiatric diagnosis. The nine scores evaluate th 


patient in terms of most of the different aspects 0! 


behavior which are currently considered by ps 
chiatrists to be important in a description of ment 
hospital patients. Due to its relative objectivit 
standard nature, and quantified features, this typ 
of description of the patient’s current symtomatolog) 








; on the other hand, the present grouping 


should be more favorable for research purposes than 
the usual diagnosis of the psychiatrist. It should 
be pointed out that the symptom ratings were made 
during the first few weeks of the patient’s hospital- 

1 and at about the time the Rorschach test was 


adm! istered. 


[here may be some readers who are inclined 
to question the relevance of a validation study 
of discrete Rorschach scoring categories. In- 

smuch as isolated categories are not used in 
Rorschach practice, this procedure may at first 
seem unreasonable. In an earlier re- 

rt [2], however, the writer has shown that 
combination of scores such as the differ- 
es and ratios employed in scoring the 
Rorschach can bear a higher relationship with 

riterion than an optimal weighting of the 

ynent scores. For this reason, it seems 

priate to study the validity of the com- 

a nonent scores of the Rorschach test because if 
there is an evidence for the validity of these 
mponent scores, we may be optimistic about 

ilidity of combining them for the predic- 

of different descriptive criteria. If, how- 

ess ever, none or relatively few 





omponent scores 
1 validity for predicting the descriptive 
ts of the patient’s manifestations, the 
nt data, at least, cannot be employed to 
rt anv optimism concerning the possibili- 
‘ using combinations of scores for predict- 
he descriptive status of the patient. It 
Id} that any validity or 
f validity revealed by the present sample 


ye noted, however 
t be generalized to outpatient samples, 
it reasonable to suppose that any validi- 
lack of validity found for the descriptive 
the Rorschach test may be applied to 
uses such as evaluations of the individu- 
It should also be 


ed that there are many important fea- 


otivational pattern. 
the tures of the Rorschach test which are common- 

‘d both for descriptive and dynamic evalu- 
included 
characteristics of the subject’s 
, mode of response to the testing situa- 


which are not in the present 

1. ses, @.9.. 
sts es it. 
ind idiosyncratic or peculiar content. 


2 fh | . . 
e Rorschach scoring categories emploved 


e present analyses are: WY, DW, DdW. 
VS8.D, Dd, DdD. Ds, Dds. R, F+. F. F—. 
vee 1, FM, m, FC, CF, C, FC’, C’F, C’, Fe, 
og) PPV, FY,A,H, Hd, At, Xray, Arch, Geo, 


DI . 
eyes, P, O. color shock, shading shock. 


sy » ¢! 
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RESULTS 


The results of both analyses 1 and 2 are 
rather discouraging from the standpoint of 
using the Rorschach scoring categories or any 
combination thereof in the prediction of the 


symptoms patients are likely to manifest. 


TABLE 1 


DISTRIBUTION OF CF RESPONSES FOR PATIENTS 
IN EACH OF THE DIAGNOSTIC GROUPS 


CF Responses 


Total 
Diagnostic Gr One 
None or More 
Paranoid Schizophrenia 15(15) 15(15) 30 
Manic 3( 7) 10( 6) 13 
Involutional 5( 7) 7( 5 2 
Simple Schizophrenia 6( 5) 4( 5) 10 
Depressed 10( 5) 0( 5) 10 
Total 39 3¢ 75 
The parenthetical values are the frequencies expected 


from the hypothesis of uniform distribution. 


Only 1 of the 39 + 
resulted in statistically significant 
differences between diagnostic groups. These 
differences are for the CF 
(Table 1), and it seems that the 
absence of the CF score may be 


tests required by 


analysis 1 
Rorschach score 
presence or 
a useful aid 
in distinguishing between patients who are in 
a manic or a depressed phase. Unfortunately, 
the distinctions between patients who are in a 
manic or depressed phase does not often re 
quire the employment of a supplementary de 
the Rorschach Other trends 
are suggested by analysis 1; these were signifi- 


vice such as test 


cant at only the 10 per cent level. For ex 


ample, there is a slight tendency for the mani 

depressed patients in a manic phase to show 
the human movement response, whereas the 
human movement response is shown less often 
than would be fas 


anticipated involutional 


patients Similarly there was some evidence 


that ini 
the FY 


groups 


olutional patients will show more of 
than the 
included in 


responses other diagnostic 


analvsis 1 whereas the 


manic depressed patients in a depressed phase 
tend to avoid the Fy. The results 


show some slight tendency for the involutional 


response. 


patients to make a relatively prevalent use of 


human details. There mav be some tendency 


for the paranoid schizophrenic patients to use 
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relatively more of the whole responses involv- 
ing space than the other patients; there is also 
a slight tendency for the involutional patients 
to avoid the use of whole reponses which in- 
volve space. In general, these trends are con- 
gruent with our beliefs concerning the Ror- 
schach performance of mental hospital patients 
and contribute nothing to our beliefs concern- 
ing the Rorschach test. On the other hand, 
in these analyses there is a conspicuous lack 
of statistically significant trends. This disap 
pointing fact obviously cannot be used as a 
basis for saying that the Rorschach is an in- 
appropriate device for descriptive diagnosis, 
but it is sufficient to warrant extreme caution 
and skepticism in using the aspects of the Ror- 
schach test in question as aids in predicting 
the symptomatic behavior of patients, partic- 
ularly the aspects of the patient’s behavior 
which psychiatrists would employ in a descrip- 
tive diagnosis. 

In order to satisfy the requirements of 
analysis 2 leach of the 39 Rorschach scores was 
correlated with each of the nine symptom 
cluster scores. The correlation coefficients em- 
ploved were either Pearson r’s or the tetra- 
choric correlations.) The shapes of the dis- 
the various 
differ greatly, and it is improbable that the 
sampling distributions for the various correla- 
tion coefficients are highly similar. The cor- 
relation coefficients are based on a sample of 


tributions for Rorschach scores 


117 cases, but these correlation coefficients are 
doubtlessly less reliable than correlation coefh- 
cients based on normally distributed samples 
of this size. For lack of an alternative that is 
both desirable and feasible, correlation coeff- 
cients of .20 are arbitrarily taken as significant. 
Only 3 per cent of all the possible correlation 
coefficients met this arbitrary criterion for sig- 
nificance.' Accordingly, it appears possible that 
the number of large correlation coefficients 
found in analysis 2 is not greater than would 
be expected by chance. 

There is little justification for elaborate dis- 
cussion of these correlation coefficients. Two 
of the Rorschach scores, achromatic color and 
the number of popular responses, show more 
than one correlation with the cluster scores. 
Achromatic color was correlated to the order 
of .23 and .27 with the manic cluster and the 
paranoid condition cluster, respectively. The 
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number of popular responses was negatively 
correlated to the order of .20 with the excited 
cluster, negatively correlated to the order of 
$0 with the paranoid schizophrenic cluster 
and negatively correlated .22 and .21 with the 
paranoid condition cluster and the hebephreni 
or deteriorated schizophrenic cluster, respec- 
tively. The fact that the number of popular re- 
sponses is negatively correlated with some of 
the psychiatric syndrome scores is plausible 


corresponds with the 


implicat 
given popular Rorschach responses. It is 


common 


that the color shock s 
is correlated to the 20 
anxiety cluster. 


( nterest to note 
order of with ¢l 
This is also congruent wit! 
the manner in which color shock is used 
evidence for anxiety and other types of neu- 
rotic defects. The results of analysis 2. like 
those of analysis 1, are not encouraging for the 
use of Rorschach scores as aids for the descrip 
tion of the symptomatic manifestations of psy 
chiatric patients. 


CONCLUSIONS 


The present study, based on newly admitte 
patients to the Connecticut State Hospital 
examines relationships between commonly en 
ployed Rorschach scores and a variety of 
criptive aspects of the behavior of hospitalized 
functional psychotics. The present study con 
prises two parts. The first part compares the 
distributions of Rorschach scores for groups 
of patients with well-known psychiatric di 

The 


scores with scores for empirically determine 


noses. second part correlates Rorschacl 
symptom clusters. 

The present analyses cannot be taken as 
an evidence that descriptive aspects of the be 
havior of psychotic patients may be identified 
by or predicted by a use of Rorschach scores. 
Although the results of the present analyses 
must be limited to the use of Rorschach loca 
tion and determinant scores and to the des 
criptive aspects of mental hospital patients’ be 
havior, it is suggested that caution, if not 
skepticism, be employed in attempts to use 
Rorschach scores as an aid in descriptive diag 
nosis. 


Received February 16, 1951. 
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IMPULSIVITY AS INDICATED BY RORSCHACH 


TEST FACTORS 


RILEY W. GARDNER 


UNIVERSITY 


ANY writers on projective techniques 
contend that major criticisms of the 
Rorschach test as an instrument for 

predicting overt behavior center around prob- 
lems of of test factors. Schafer 
clarifies this argument considerably by pointing 
out that interpretations rather than scores are 


validation 


the proper units for research in projective tests 
and states that the actual need is for validation 


of thesguaintepretations [9, p. 332]. In an 
ae designed to test the supposed 
Rorschach intellectual control, 
Meyer Williams [10] would appear to have 
made a step in this direction. He found signifi- 
cant and reliable correlations between inter- 
pretations of a few selected test factors and 
a quantitative measure of intellectual control 
in an anxiety-arousing experimental situation. 


indicators of 


The present study is intended to be a fur- 
ther step along this line. The general hypothe- 
sis is that rankings of subjects based upon com- 
mon interpretations of Rorschach score factors 
applying to an impulsivity-inhibition continu- 
um will correlate significantly with rankings 
of these subjects by raters from (1) long- 
term acquaintance with the subjects, and (2) 
brief behavior samples, in this case responses 
to the Rosenzweig P-F Test [8]. 

The definition of impulsivity used in this 
study was drawn from Rapaport’s writings on 
the Rorschach test [5, pp. 238-241]. This 
definition is based upon a conception of person- 
ality which equates control with ability to de- 
lay response to external stimulation. If person- 
ality can be said to consist in large part of 
those mechanisms which are gradually inter- 
posed between basic impulses and the environ- 
ment during the course of development, the 
strength and quality of these mechanisms will 
be indicated by the degree of control which 
can be observed in the person’s response to ex- 
ternally determined situations. If affect can 
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be considered, furthermore, a momentary de- 
rivative of original drives and these interposed 
mechanisms [cf. 5, p. 239], the person’s way 
of handling affect, its quality, and its extent 
should be distinctive features of his response 
to any situation. The question investigated in 
this study is, then, whether or not Rorschacl 
factors interpreted by four prominent writers 
on the test (Beck, Rapaport, and Klopfer and 
Kelley) as indicators of impulsivity allow us 
to predict that a person will also be “impul 
sive’ in his everyday life. 

RATIONALE FOR THE INTERPRETATIONS OF 

TEST FACTORS USED IN THIS STUDY 


A survey of the major works of these four 
writers on the Rorschach test [1, 3, 5] 
vided the experimenter with a group of inter- 
pretations generally agreed upon as in some 
way indicative of varying degrees of impul 


sivity. 


pro- 


Color Responses. Rapaport states that the 
pure color response (C) “represents either the 
extreme of impulsive and wild affectivity, or 
an abandonment of all control” [5, p. 242]. 
3eck summarizes his interpretation of the C 
response as follows: ‘“The undiluted color rea 
tion, C, is the test’s equivalent of the 
hibited feeling experience” [1, p. 27]. Klopfer 
and Kelley formulate their interpretation in a 
1 


simi 


ar way: 
Reactions like “blood” or “fire” gjyen indis 
criminately to any red or pink spot in alf ten cards 
like “sky” or “water” to any trace of blue, re 
veal that the last remainders of a rational permea- 
tion have been extinguished in the emotional re- 
actions represented by such responses [3, p. 286]. 


In the present study the C 
were used in three ways (see Table 1): in 
ratios representing (1) the relationship of CF 
(color-form) and C responses to FC (form- 
color) responses, (2) the proportion of CF and 


responses 
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C responses in the total record, and (3) the 
proportion of responses in the total record in- 
volving color. They were also included in the 
culation of the Experience Balance for each 
subject (the sum of the movement over the 
«ym of the color responses). Throughout this 
study color responses were weighted as fol- 

ws: C, 1.5; CF, 1; FC, .5. 

The four writers agree that the CF re- 
sponse is characteristic of a less impulsive re- 
.ctivity than is the C response [cf. 1, p. 28]. 
The FC response is considered a still less im- 
pulsive and more socially conditioned one, 
which indicates the subject’s ability to deal ef- 
fectively with his affects and to modify them 
appropriately in reference to external reality. 
FC responses were used in two ways in the 
sent investigation: (1) in their relationship 
CF and C, and (2) as one element in the 
total color responses as they relate to the total 
imber of responses (R). The first use is 
sed upon the assumption that the FC re- 
ponse indicates much more control of affect- 
ty than does either the CF or C. The 
second is based upon the assumption that re- 

nses involving any amount of color deter- 

lation are more representative of tendencies 
rd impulsive expression of affect than are 
ses determined solely by form qualities 
he blots. 


> = 


lhe Experience Balance. This ratio (sum 
Mf: sum C) is among the best known of all 
Rorschach test factors. Movement responses 
re traditionally associated with internal stim- 
lation, in such forms as may result in fan- 

activity, whereas the various color re- 
onses are associated with outgoing affective 
sponses to environmental stimuli. Of this 
Rapaport writes as follows: 


+ 


taking the movement and color responses to- 
r, it is possible to obtain an estimate of the de- 
f coarctation or dilation of the subject. We 
ilso emphasized that the “direction” of the 
tative wealth is indicated by the relative 
ilence of movements over colors, or vice versa. 
“direction” reflects the relative emphasis 
! by an individual upon thinking and the de- 
ction it implies, or upon free affective ex- 
n and upon the corresponding modes of con- 
f action AS pp. 265-266 1 


\ 


Beck states that this ratio represents ‘the 
balance of introversive against extratensive 


trends, of fantasy living against outward ex- 
pression of the feelings” [1, p. 60]. Klopfer 
and Kelley feel that color responses “reflect 
the reaction to ‘stimuli from without n con- 
result- 


‘“promptings from within’”’ [3, p. 


trast to movement responses, which ; 
ants ol 
205 In this study the usual ratio, sum M: 
sum C, was used to represent the balance be- 
tween movement and color responses in the 


subjects’ Rorschach rec 


Form Level of Color Responses. \t has long 
been recognized that the. degree to which a 
subject is able to maintain a high level of form 
quality in responses involving color is an im- 
portant indicator of the amount of control 
present. Our four writers are in agreement 
upon this point. 

In an attempt to use this relationship be- 
tween form level in form-color responses and 
form level in other responses, a ratio was used 
in this study between the F % of the FC 


esponses in each record and the F % of all 
responses in which forn the principal deter- 
minant. The use of this ratio was in part de- 


termined by the fact that in the Rapaport 
scoring system, used throughout this study, CF 
as well as C responses are not scored with re 
spect to form level. This may well have re 
duced the effectiveness of correlations such as 
those calculated from these Rorschach records. 

Form Responses. If the pure color response 
is interpreted to be the response most indica- 
tive of impulsive, explosive outbursts of af- 
fectivity, the pure form response is generally 
considered the one most indicative of emphasis 
upon “‘intellectualizing” and the most effective 
control in avoiding the disturbing influence of 
affects in the handling of the stimulus material. 
Klopfer and Kelley state, for example, that: 


rhe concentration of a subject’s interest on the 
surely “formal” qualities of the blot material seems 
to indicate the degree to which the subject is in- 
clined to repress or control less “formal,” more per 
mal or spontaneous reactions to the stimulus 
material | 3 203 

Only the percentage of F+ responses was 


used to reflect highly organized form re- 
sponses, without inclusion of responses scored 
F+ in the Rapaport scoring system. Addition- 
al factors were the F% (percentage of each 
total record consisting of pure form responses) 


and Rapaport’s “new F%,” which expresses 
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the proportion of all the responses in an in- 
dividual record in which form is either the sole 
or the principal determinant. 


Other Factors Used in This Study. In ad- 
dition to form and color indicators, four others 
were employed. The first two, the percentage 
of responses listed by Rapaport as “popular’”’ 
and the percentage of animal and animal detail 
responses, were used since they are generally 
interpreted to vary roughly with restriction 
in affective expression and with conformity to 
social norms. The third, the size of R, was 
used in an exploratory sense only, since it is 
the resultant of a mingling of factors, includ- 
ing many which are not the direct subject of 
this investigation. A variation of R was also 
used, on the basis of suggestions in Beck’s 
writings — a ratio between the number of 
sponses to the first seven cards and responses 


re- 


to the last three (none of the former being en- 
tirely colored). 

Thus, an important segment of the litera- 
ture on the Rorschach test yields rather con- 
sistent interpretations of various test factors 
relating to an impulsivity-inhibition continu- 
um. It is the purpose of this study to show, 
ina very limited way, whether these interpreta- 
tions agree with actual observations and rat- 
ings of the overt behavior of individual sub- 
jects. It is granted that each writer quoted 
may be speaking at one time or another of 
potentialities for action, rather than overt be- 
havioral phenomena. In general, however, 
their interpretations are intended to allow 
predictions of the reactions of individuals to 
life situations, and it is in this way that they 
will be considered here. 


HYPOTHESES 

Preliminary hypotheses were: (1) that sig- 
nificant intercorrelations would be found 
among the rankings of the subjects by 
three raters assigned to each behavior sample 
(long-term acquaintance and responses to the 
Rosenzweig P-F Test), and (2) that a simi 
lar correlation would exist between the aver- 
age rankings of the subjects by the two groups 
of raters. 

Specific hypotheses concerning the interpre- 
tations of the Rorschach test factors appearing 
in Table 1 were based upon pre-experimental 
judgments of the degree to which rankings of 
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TABLE 1 
RANK-ORDER CORRELATIONS BETWEEN SUBJECTS 
VALUES ON 12 Test FACTORS AND EsTIMA1 
F THEIR IMPULSIVITY BY Two Groups 
OF RATERS 
Estimates of Impuls 
Based on 
Rorschach Scoring Category Long-Term P-F Te 
Acquaintance Respons 
A. High correlations predicted 
CF + C (including 
C/F) : FC $79 2 
CF +C:R 865 f 
F( CF + C:R 788 f 
Sum M : Sum C .705 42 
F t+- J .549 +1 
B. Fair to good correlations predicted 
} Yo of FC responses 
F % of all form- 
primary responses -.191 -491 
F .342 261 
New F% 277 —.006 
Popular : R .670 g 
| 1d :R 518 f 
C. Low correlations predicted 
nses to Cards I-VII 
VIII-X 349 179 
R .382 482 
the subjects by means of these interpretations 
would correlate with the two criteria (t! 
average ranking by each of the two gro 
raters). These judgments were based largel 
upon the degree of confidence which the four 
writers place in individual interpretations 
the consistency with which these interp: 
tions appear in the writings. Thus, it w 
hypothesized that rankings based upon 
pretations of the factors listed under A 
Table 1 would show significant correlat 
with the two criteria. It was assumed that 
rankings based upon the factors under B 


would 


dependent rankings of the subjects, but w 


show lower correlations with these 


' 
oul 


/ 


nevertheless give some valuable indications in 


respect to impulsivity. 


It was hypothesiz: 


; 


that rankings based upon the factors under ( 
in spite of their mention in the literature 


fair, and sometimes good, indicators of 


pulsivity, 


tions with the criteria. 


PROCEDURE 


would not show significant correla 


Subjects were eight men and two women 





en 
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- only specific criteria applied to them were 
that they be university undergraduate or grad- 
» students (an attempt to gain some control 
ver the level of intelligence) and that they be 
known personally to all three “long-term” 
raters for a period of at least eight months im- 
mediately prior to the investigation. ‘They 
ranged in age from 21 to 27. 


~~ 


[he three long-term raters were graduate 
students in clinical psychology who were se- 
lected because of their mutual acquaintance 
with all ten subjects and because it was felt 
that they were qualified to rank these subjects 

in impulsivity-inhibition continuum. They 

ed Rapaport’s statements about impulsivity 
5] as the basis for their independent rankings. 
[he three raters who ranked the subjects by 
of the Rosenzweig P-F Test records were 
rraduate students in clinical psychology. 
were not acquainted with the subjects 
were given no clues as to their identity. 

ir ratings were made independently. 

Each subject was administered two tests by 

writer: (1) the Rorschach [7], in ac- 
rdance with the directions for administra- 
offered by Rapaport [5], and (2) the 
nzweig P-F Test [8]. The latter test 
sed because it seems to provide, in a very 
period, a good sample of behavior with 
t to the impulsivity-inhibition continuum. 
lhe Rorschach records were scored according 

Rapaport’s suggestions and then rescored, 

th a careful cross-check being made to in- 

iniformity. The cross-check was used to 

Hset any influence the examiner’s personal ac- 
ntance with the subjects might have had 
The Rosen- 

r P-F Test records were given to each of 
he three raters allotted to them in their or- 

|, unscored form and without any identi- 

r marks. They were arranged in an order 

ed upon the previous rankings of the sub- 


ts by the long-term raters in such a way as 


is scoring of their responses. 


t Mage 
id influence of order upon the new rank- 


Values for the twelve factors listed in Table 

re then extracted from each Rorschach 
1 and the ten subjects were ranked ac- 
ling to the interpretation of each factor, 
the order of impulsivity indicated by the 
tive strength of that factor. Spearman 
nk-order correlations were computed for the 


‘various determinants in. the _ blots. 


relationship between each of the twelve rank- 
ings and the two criteria. The significance of 
these correlations was determined on a scale 
derived from the formula for ¢ in small sam 
ples, described by Lindquist [4, p. 211] and 
Garrett [2, p. 346] 


which does not assume 


normality of the sampling distribution 


RESULTS 


The subjects displayed a wide range of re- 
sponsiveness to the series of ink blots and to 
Total 
number of responses (R) varied from 15 to 
105. The Experience Balances of the subjects 
were: 4/0, 5/1, 10/3, 18/6.5, 5/6, 11/17.5, 
6/16.5, 1/4, 2.5/10, and 1/5. The Fo (per- 
centage of R in which form was the sole de- 
terminant) ranged from 46 to 73 

Intercorrelations between rankings of the 
subjects by the three long-term saters were 
600, .769, and .346. Inquiry revealed that 
the low correlation, .346, between two of the 
long-term raters resulted largely from a 
marked difference of opinion about one sub- 
ject. Between the three raters ranking the 
subjects by means of the Rosenzweig test, the 
correlations were .830, .855, and .721. The 
correlation between the average rankings by 
the two groups of raters was .815. (In this 
study the .01 level is .765, the .05 level is 
632.) 

In the first hypothesis about interpretations 
of the test factors it was stated that the fact- 
ors under A in Table 1 would have high cor- 
relations with the average ranking by each set 
of raters. This appears to be borne out for the 
most part in the case of the long-term raters 
Rosen 
The factors listed under 


the second hypothesis seem not to fit our ey 


but to be less so for the raters of the 
zweig test records. 


pectations, with the possible exception of the 
Popular:R ratio and the 4 {d:R ratio. 
The former slightly exceeds the .05 level in 
elationship to the long-term raters and ex 
ceeds the .01 level with the raters of the 
Rosenzweig test records. The latter achieves 
the .05 level only in relationship to the raters 
of the shorter behavior sample. As was ex 
nected none of the factors listed under the 
third hypothesis approached significance. Thus, 
the first and third hypotheses were fairly well 


substantiated by the results: the second was 
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not. Among those factors which show signifi- 
cant correlations, as might be expected, the 
long-term ratings appear to be generally more 
satisfactory than the ratings of brief behavior 
samples 

CONCLUSIONS 

Two general conclusions arise from this 
study: (1) this type of investigation seems to 
have some value in determining the validity of 
various interpretations of Rorschach factors; 
and (2) there is serious need for careful de- 
termination of the predictive powers of com- 
monly accepted interpretations. Although in 
this limited study the Rorschach test seems to 
contain several factors which have predictive 
value for the overt behavior of individual sub- 
jects, other factors interpreted with similar 
confidence do not correlate significantly with 
behavioral criteria. It is true that in clinical 
practice these factors are not interpreted sing- 
ly, but in clusters. Much of the test’s useful- 
ness, however, depends ultimately upon the 
validity of interpretations of individual factors 
which make up these clusters. 

As might be anticipated, rankings based up- 
on considerable day-to-day acquaintance with 
subjects correlate better with test interpreta- 
tions than do rankings based upon a brief sam- 
ple of behavior such as the Rosenzweig P-F 
Test. The correlation of .815 between the av- 
erage rankings by the two groups of raters sug- 
gests, however, that the P-F Test when used 
in this way offers a surprisingly rich sample of 
behavior with respect to an impulsivity-inhibi- 
tion continuum. 

More specifically, those factors appear to be 
rather good for predictive purposes in which 
quantitative aspects of various color responses 
are placed in ratios with R, or in which the 
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more strictly color responses are placed in ratio 
with responses determined by both form 
with form primary. 
Balance and the F+% 


good 


color, The Experie: 
may also be rather 
indicators as they are generally inter 
preted. Other promising interpretations 
those concerning the percentages of pop 
and animal responses, their size being said t 
vary inversely with the degree of impulsivit 
Such FC 
sponses; F+9% of all form-primary respon 
F%; new F%; R; and a variation of R 
found to be very poor indicators for the r 
tive degrees of impulsivity characterizing 
subjects of this study. 


however, as F+% of 


tactors, 
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EXPERIMENTALLY INDUCED FAKING Ol 


ler TAT RESPONSES *’ 


= EDITH A. WEISSKOPE 


AND 


vs JORGE J. DIEPPA 


PURDUE 


HEN an individual is given a projec- 
tive test, the outcome is frequently of 
- vital future. 
Vhether or not he is selected for a position 


importance for his 
depend, in part, on his performance. If 
ants the position, he may make a conscious 
t to give responses which make him ap- 
the kind of person most suited for the 
. oarticular occupation. On the other hand, a 


+7 son who wishes to be discharged from the 





forces may attempt to respond in such 
is to appear a bad prospect for adjust- 
litary service. It is important 


~~ 
ore, that projective techniques should be 
tant to such l ‘The 
of Fosberg indicate 
requirement. 


Shavzin [2] 


Listortive 


¢. 4 


Maiingering. 
| seems to 
fulfills this 
Carp and 


in that Rorschach 


| tT 
the Rorscha 


hand 


' 
+ 


e other 
can be 
Meltzoft [6] 
completion test indicates that 
subject to distortion. 


responses 
Similarly, a study by 
sentence 
st test may be 
as the TAT can be e 


t to faking to a higher degree than a 


tes sucn xpected to 
h as the Rorschach. This expectation 
/ ° . ° 
ed on the impression that subjects are 

1 on the that subject 


‘e of the principles of interpretation 


al of the Chief Medi 
t of Medicine and Surgery, 


lished with the approy 
ector, Departmer 


ns Administratior 
for the 


who assumes no responsi- 
opinions expressed or conclusions 
he paper is based on a thesis presented to the 
ite School of Purdue University, Lafayette, 
na, by the junior author in partial fulfillmen 
requirements for an M. S. degree in psychol- 
The study was directed by the senior author 
This study was reported at the annual meeting 


Chi 


he Midwestern Psychological Association 
April 1951. 
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UNIVERSITY 


with the TAT than with the Rorschach 
It is the purpose of the present study to test 
the following hypothesis: Subjects can influ 


ence the evaluation of their personalities based 


on TAT stories by trving to make a “good” or 


“bad” impression on the examiner. An ex 
amination of some of the methods used by the 
subjects to create a good or bad impression is 
a further objec tive of tl paper 
SUBJECTS 
The subjects were 24 hospitalized male 
veteran diagnosed as psychoneurotic. [he 


irotic instead of 


reason I 


‘ t= } 

so-called ‘“‘normal’’ subjects was their avail 
Lit: - ¢} } ' t\ 
ability to the junior author in fis capacity as 
) } ] y ' 
Psychol t intern at Veterans Hospital. 


The subje 


with a mean age of 34.1] 


23 to 48 years, 


ts’ ages ranged fron 


Dhre ards of the TA] series were ad 


| he ( ards 


ministered to each subject were: 
No. / boy with a violin; No. 8BM, a boy 
n the foreground, two men performing a sur 


gical operation in the background; No. 1/3 MF 


a man standing, a woman with bare breasts 
lying in bed. 
Ihe rationale for the selection of these pic 


tures was as follows: (a) It was attempted to 
select pictures which elicit a relatively high 
amount of projection. According to a previous 
study by Weisskopf [7 
tioned pictures are among the ten pictures in 
the TAT series of elicit the 
highest amount of projection. (b) It was at- 


the three above-men- 


J 


twenty which 
tempted to select pictures which elicit material 
on a variety of adjustment areas. The above- 
mentioned pictures usually contribute to the 
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diagnosis in the areas of level of aspiration 
(Cards /, 8BM), relationship to the parents 
(Cards /, SBM), aggression and sadism 
(Cards SBM, 13A1F), heterosexual impulses 
(Card 1/3 MF), etc. (c) It was attempted to 
select pictures which would elicit stories most 
likely to be distorted when the subject tries to 
make a “ovood”’ or “bad” impression. This 
would be expected with pictures eliciting 
stories on such personality characteristics as 
level of aspiration or aggression which are 
strongly regulated by social mores, either 
through encouragement or through taboos. 

It may be objected that the authors “loaded 
their dice’ by selecting pictures which can, 


more than others, be expected to lead to cor 


roboration of the hypothesis. However, such 
procedure seems permissible if, in case of 
sitive results, it is kept in mind that the lack 

‘foolproofness” has only been demonstrated 

three pictures and that other TAT cards 
be more resistant to faking. 

Three sets of instructions were used in ad- 
ministering the pictures. One of these consist- 
ed of the traditional instructions as suggested 
by Murray. This set will be referred to here 
after as the Neutral instructions. 

Another set of instructions contained the 
following suggestion: “This time I want you 


1 


to tell your stories in such a way that, when 


I hear them, I will receive the dest possible 


impression of you.”’ These instructions will be 
referred to hereafter as the Best instructions. 

The third set of instructions was formulated 
as follows: ““This time I want you to tell 


vour stories in such a wavy that when I ! 


them I will receive the worst possible impres- 
sion of you.” These instructions will be called 
the Worst instructions in the present discus- 
sion. 

All subjects were first tested with the three 
pictures under Neutral instructions. Then the 
following explanation was given: “This test 
can give information on what kind of a person 
you are. But you can fake your stories so that 
I may receive a good impression or a bad im- 
pression of you according to your wishes.” 

After this explanation, the three pictures 
were given to one half of the group of subjects 
under the Best instructions first and under 
the Worst instructions second. The pictures 
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were administered to the other half of 
group under the Worst instructions first a 
the Best instructions second. The Neutral 


} 
+ 


structions were administered first to ea 
ject because it was assumed that it woul 
dificult to obtain a truly spontaneous 

|’ story after the subject had been ex 


to the Best or Worst instructions. 


In the following discussion, three 
| obtained from the same subject by the 


‘uctions, will be referred to 


cols were distributed among t 
ned and experienced in the 
TAT. Each judge rec 
ols, ft 1S, one protocol from ea 
The protocols received by each }j 
1! an equal number obtained by 
tral, Best, and Worst instructions. 


Moreover, each judge received the s 


, ° 
er of Jest protocols produced prior 
» Vf 


» 
Vorst protocols is Best pr tocols p! 


the Worst proto¢ ols. The proto ols 


1 } 


judge were arranged in random « 
numbered from No. 1 to N 
ler was different for the three 


1 


lzes were asked to rate each 
indicated by the number of tl! 
rv high, high, average, low 


a } a : 
spect to the following 


Over-all adjustment 

a. Total hostility (repressed and conscious 

b. Conscious control of hostility 

c. Repression of hostility 

d. Guilt about hostility 

a. Total sexual impulse (repressed and 
scious) 

b. Conscious control of sexual impulse 

c. Repression of sexual impulse 

d. Guilt about sexual impulse 

Level of aspiration 

Eagerness to conform 

Tendency to help others 

Tendency to be helped by others 

Intelligence 

Spontaneity 


It was stated in the instructions that rating 
could be omitted if no evidence was found | 
the protocols. 

The judges, when rating the subjects, 
10t know that part of the protocols had bh 
obtained by instructions different from th 











allel 


ditional 
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ones. However, each judge re- 
ed a sealed envelope with the instructions 
t it should not be opened until after the 
es had The following 
‘The 
tocols you have just read were obtained by 


(A 


ription of the three types of instructions 


been completed. 


t was enclosed in the envelope: 


following three different instructions: 


given.) Go over the protocols again and 
k each one Neutral, Best, or Worst accord- 
to the instructions which you think pro- 
1 the story.” 


RESULTS 

n order to be able to manipulate statistical- 

the qualitative ratings on the rating sheets, 
erical values were assigned to these ratings 

follows: 


Very high 5 

Hich 4 

Average 3 

Low 2 

Very low - 1 
Analysis of variance was used to evaluate 
data. Three variables were considered: 
tions, judges, and subjects. Only the 
Its relevant ro the hypothesis will be re- 


1 here, namely, those pertaining to the 


t 


tions in instructions. 


Variations in instructions produced a dif- 
e statistically significant beyond the 1 
ent level in the ratings of overall adjust- 
t, total hostili nd eagerness to conform, 
difference significant between the 5 per 

nt and 1 per cent level in the ratings of total 
| impulses and spontaneity. The ratings 


the other questions were not significantly 
ted by variations in instruction. 

It appears that the subjects were rated as 
vell adjusted 


Ty 


n, or stronger se’ 


more hostile, less eager to 
ual impulse and more 
taneous when the ratings were based on 
Worst stories than when they were based 
the Best and Neutral There was 
for the Neutral ratings to be be- 

1 the Best and Worst ratings, but con- 

2 


stories. 
ndency 
ibly closer to the Best ratings. 

he latter finding is consistent with the 
Its of the judges’ attempts to identify the 
tral, Best, Worst The 


‘s were able to label correctly 79 per cent 


and protocols. 


the Worst protocols, but only 58 per cent 


AT RESPONSE‘ | 
of the Best protocols and 
Neutral protocols Lhe 


i 
the per cent dentifications of Worst 


58 per cent of the 


ditterences between 


proto ols ind o} Best Oo! Neut il protocols is 
significant beyond the 1 per cent level. ‘hus, 
the Worst protocols were characterized by dis 
tin tive features to a higher degree than the 


Best and Neutral pr itocols 


Ihe follow ing quotations trot the protocols 


oft a subject illustrate qualit tively ome ot the 
sults of this stud espe ll the increased 
1g sion id sex impul elicited by the 
W t { tio lhe i ple il 0 il] S 
} 
t t f f the Neutr | stories tend t he more 
| 
SImiis I to the Be f Tories tl im ft thy Wor t 
stor 
The subject told the following Neutral 
story about Pict re V7 i 
Well, it looks to me like this woman is sick or 
something and that is her husband there, well, looks 
awful downcast because of her conditio That is 
about all I can see out of it. Mavbe she will cet 
better. mavbe she won't That bout all I can 
make of it. That all 
, - , 
Here is the Best story, told by the same 
ject to the 
I would say that this woman had been sick with 
some illness for, »h like pneurlrT i somethi 
like that. Has been between life and death for 
i 
long time and this man has been doctoring her or 
perhaps the books on the table could represent a, 
one could represent a Bible He could have been 


reading or administering to her. But finally she 


has come out of a coma or something and he knows 
she is going to be all right and is weary from watcl 
ing over her face so long and now is either thank 
ful, or praying, or something. Outcome is like it 
is and she is going t et bette 

Finalh the Worst sto 

Well, let’s see. Well, the only thing wrong is 
the man’s hand over his face. He would be lear 
ing over the woman and would get the im 
pression that this woman is a prostitute. The woman 
seems to be dead and the man has killed her and he 
is sorry for his deed. He killed her and now is 
sorry that he has done it. That would be all I can 


think of 


Another subject carries the desire to appear 
“asexual” when trying to make a good impres- 
Here 1S h Ss 


sion to the extreme. Rest story 


about Picture /3MI 


This is a story of a great love of a young lady 
Her beauty and enchantment this voung fellow has 
admired since he was a boy. Having been reared 








together, his love grew for her as they grew older. 
She in return returned this love although retaining 
her virginity. They waited long for the day of 
their parents’ consent to be married. Their life was 
moral, decent, both members of the church helping 
those in the community who were less fortunate. Oh! 
the world looks so good in the future of these young 
people! They are married finally. They have a 
church wedding, friends, mothers and fathers at- 
tended. Having a wonderful time. They leave 
church to go on their honeymoon. This young fel- 
low who has been shy although he possessed great 
love for the lady, when they went to bed he found 
it was impossible to go to bed because he felt that 
something was being destroyed. He is immature, 
unable to achieve the happiness of sexual desire 
which are normal for a man and wife. Their life 
has disappeared. He realizes that they are headed 
for the rocks. 


QUALITATIVE COMMENTS 

A qualitative study of the protocols throws 
some light upon the mechanisms which the sub- 
jects use to make a favorable or unfavorable 
impression. 

A comparison of the Best or Neutral proto- 
cols with the Worst protocols suggests the 
tentative interpretation that many subjects at- 
tempt to appear socially acceptable in the Best 
or Neutral stories by introducing defenses 
against the socially unacceptable impulses 
shown in the Worst stories. The defenses used 
are such well-known mechanisms as sublima- 
tion, projection, reaction-formation, rationali- 
zation, displacement, etc. 

One subject, for example, tells the following 
Worst story to picture 8BM: 


Well, I would say that this was a young man 
given in two different ages. Almost like a dream. 
He has dreamt for a long time to bring the ending 
of another person, possibly his own father and that 
is what he has been doing for some time. He has 
been carrving this dream for some time and did not 
fulfill his wish until late in life when his father 
was old and decrepit. He is being helped by a 
friend, on the perverted side himself, who is only 
there to witness the pain and anguish of the man. 
I see in the future that he will be able to get along 
with society and the law but he will suffer himself 
because of his conscience in later life. I see him 
only as a murderer. 


The Best story, told by the same subject to 
the same picture, runs as follows: 


Well, I see this boy is dreaming and he has just 
witnessed for the first time—his father who is a 
doctor has allowed him to witness an operation. I 
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see him dreaming and hoping that in the futur 
he will have the hand of his father that he may 
able to prolong life as he has seen it up t 
ne. He may not have any appreciatior 
father until he has seen him now to know the tr 
lations while the patient is under the knife 


operation may prove unsuccessful and the b 


came confused as to whether he will be a 
withstand the shock of failure that may cause d 
whether the reward of lengthening the 
some will be worth the pain of failure wi 


ose a Case 


A comparison between the two stories sug 
gests the following interpretations: 

The crude sadistic activity in the Worst 
story is sublimated into surgical activity in t 
Best story. The hatred against the father in the 
Worst story is not apparent in the Best story. 
Instead, the hero desires to imitate the father 
to make the father’s ideals and skills his own. 
This mechanism is known as identificat 
Thus, the transition from the Worst to the 
Best story occurs through sublimation 
identification. This transition resembles the s 
cialization process of individuals: the antago 
ism against the parent of the same sex is b« 


resolved by identification with parental ideas 
and unacceptable impulses are directed into 
constructive channels. 

The perception of Picture 8BM as an ag 
rressive scene under the Worst instructions 
and as a surgical operation under the Best | 
structions appears frequently in the protocols 
of this study. 

Another subject tells a Worst story about 
Picture 8BM to the effect that the boy in t! 
foreground shot the person who is being 
erated on; the boy hopes this hated person w 


luring the operation. In the Best stor 


the boy daydreams that he is a surgeon w! 
performs an operation. In the Neutral stor 
the boy in the foreground also daydreams 
the operation in the background; this tim 
however, he is not the surgeon, but he fears 
that such an operation might be performed on 
himself by other people. Again, as with the 
previous subject, the Best story can be inte: 
preted as a sublimated modification of the 
Worst story. The Neutral story, on the other 
hand, may be a projective modification of the 
Worst story, since the hero projects his aggres- 
sive tendencies upon other people and fears 
their retaliation for his own hostility. 

















In many cases the Best or Neutral story can 
be conceived as a reaction-formation against 
the antisocial impulses disclosed in the Worst 
story. 

lhus, a subject tells the following Worst 
tory to Picture 1: 


This is a young lad who has stolen a package 
hat he found in the hallway of an apartment. This 
ige later turned out to be a Stradivarius vi- 
and several musical scores which were in the 
He later, not knowing the value of the violin, 

a hammer and smashed it to pieces. That 

ts me when I say that the Stradivarius was 


smashed to pieces 


In the Best story of the same subject the 
hero shows extremely positive rather than ex- 
tremely negative feelings towards the violin: 


iis boy is about 10 years of age. He has been 

1 child prodigy since the age of four. He has 
been given a violin by his old teacher, the 
being a Stradivarius. Although this lad is 

ten he fully appreciated the value of such a 
iutiful instrument of music and he resolved never 
this violin part from him and also that he 

d become the world’s foremost violinist which 


later did. 


lhe same kind of reaction-formation can be 
found in the Neutral story: 


his is a young musical prodigy who is probably 

ister of the violin, and who has just laid his 

down across his music score and is gazing 

he violin with adoration or reverence. The 

tcome of this, he will continue being a musician 
bably ending third fiddle on some symphony. 


In the following illustration a comparison 
between Neutral and Best story suggests the 
se of displacement in addition to reaction- 
formation. The Neutral story to Picture 88M 
reveals overt aggression against a parent: 


I think the story in the picture happened around 
1890’s because the tie that this gentleman has 
the one holding the lantern, and the collar of 

s shirt. I should say that the operation is being 

performed because of a wound and this wound has 
een inflicted. I believe it is a mother by looking 

t the little breast. I cannot tell. I imagine that 

this young fellow, the son, has had an unhappy 
home life and has brought about family difficulties 

f some sort and this young fellow he mulled over 

the proposition of what happened at home without 
thinking, I should say, took that gun and decided 
murder father or mother. His attempt was un- 
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successful because they took the parent to the 
doctor who performed an operation, who is going 
to save the life of the parent. He has a look of 


regret and is wondering what the outcome oi the 


operation will be because in turn it will determine 
his future life and his rrow has caused him to 
realize that he should t have acted as he did 
At the end the parent returned to normal hea!th 
but there will always be doubt his tellow’s 


mind because of what he has done although the 
parent is willing to forgive. This is a case of an- 
other frustrated life. 


In the Best story, the hero is strongly de- 
voted to his mother: 


Well, unhappiness has been brought to the life 
of this young fellow. We find that his mother has 
been shot and he realizes, and knowing the person 
that committed the crime but not knowing the reason, 
wonders what action he should take. If his mother 
could get well and would tell him why, it would 
give him ground to decide what he should do 


He has made up his mind to shoot the person that 


has shot this mother if she ts in the ght. He is 
worried. “Oh, if she would only get home. If she 
would get well and tell me.” He is praying that 


God will bring his mother around. We hope that 
this young fellow’s prayers are answered. Not for 
the reason that he in turn can take this drastic 
action against the assaulter of his mother but that 
he will have the love of his mother. Here he is 
at twelve without a mother to keep him through 
the years of adulthood. She will come out well and 
will help him until he is twenty-one and can mar- 
ry a girl that understands him 


Here the subject’s concern for his mother 
can be interpreted as a_ reaction-formation 
against his aggression. However, the hero also 
plans to act out his aggression in an overt man- 
ner, but this time, it is displaced from his 
mother to a criminal. Thus, the hero is in a 
similar situation to that of a censor of por- 
nographic art, who can satisfy his voyeuristic 
tendencies by fighting voyeurism. 

In many cases antisocial impulses which are 
overtly expressed in the Worst story appear 
disguised through rationalization in the Best 
or Neutral story. For example, the hero of the 
Worst story may hurt or kill an enemy in- 
tentionally, while the same enemy may be 
killed by the hero accidentally in the Best or 
Neutral story. 

The following stories may serve as an illus- 
tration. In the Neutral story to Picture 8BM 
the subject does not specify whether the hero’s 
father was shot accidentally or intentionally. 
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Well, the impression I get of a youngster that 
probably went hunting with his father and shot 
him and has a dream that they are operating on the 


father. Probably worried as to the outcome 


In the Best story an accident is the cause of 

the shooting: 
Looks like a yor 

hur 


vision 


father have bee 
Has 
ut 
the 


man and 


ting and he accidentally shot his father 


on his mind or dream wondering as to « 
eame of the 


tat 


operation on 


being performed 


} 


her. Whether it is going to be good or bad. 


The Worst story, finally, shows overt 


ag- 
gression. Here, the shooting is done intention- 


ally: 


Could be that the boy intentionally shot his father 


and maybe dreams that if the bullet doesn’t do the 


job right, maybe the doctor’s knife will slip. That 


is supposing it would be a gun in there, is it? 


The fact that not only the Best stories but 
also the Neutral stories appear to be censored 
modifications of the Worst stories lends sup- 
port to Balken’s and Wyatt's statement that 
stories of the ‘(Thematic Apperception type re- 
flect mainly phantasies which are somewhat 
acceptable to the ego [1, 8]. 

SI 


MMARY AND CONCLUSIONS 


1. Subjects can, when trying to make a 
good or a bad impression, successfully influ- 
ence the diagnosis of their personalities made 
by experienced TAT interpreters on.the basis 
of their responses to three TAT pictures. 

2. When trying to make a bad impression, 
the subjects tended to change their Neutral 
records to a higher degree than when trying 
to make a good impression. From this observa- 
tion the tentative conclusion can be drawn that 
individuals taking the TAT may be attempt- 
ing to make a good impression on the examin- 
er, even if not explicitly required to do so by 
the instructions. 

3. The Best and Neutral stories can be in- 
terpreted as reflecting defenses such as sub- 
limation, projection, reaction-formation, and 
rationalization against the antisocial impulses 
expressed in the Worst stories. 


SUGGESTIONS FOR FURTHER RESEARCH 


1. Since the manner in which people at- 
tempt to make a good or bad impression is de- 
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pendent on the moral code of the group 
which they live, the results of a study suc! 
the present one can be expected to vary wit! 
iltural group from which the subject 
ected 


ted. 


In the present study the subjects 


tempted to make a good impression by appx 


ing less hostile, less spontaneous, less sexual 
and more conforming than they actuall) 
In a less conventional group, for example 
tempts to make a good or bad impression 


be carried out in an entirely different man: 


TI the technique used in this stud) 
help to obtain information about the concepts 


of morality and respectability in specifi 


groups. 
. , . , ; 
4 From a qualitative inspection of th 


records it seems probable that a comparison of 


an individual’s Neutral, Best, and Worst pro- 
0 | be 


t would of great diagnostic value. 


Such-a comparison might be more informative 
regarding the individual’s impulses, aspira 
tions, and defenses than the customary insp 

tion of neutral records. Thus, the methods 
used in the present study might be developed 


into a valuable diagnostic tool, which could 
supplement or replace the traditional TAT. 

Re. CIVE 1 
Early 


1951 


August 6, 
publication. 
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OR years, psychologists in the New 
York City Board of Education, like 
those throughout the country, have used 


the Revised Stanford-Binet as the basic tech- 
nique in measuring general intelligence. Al- 


yugh they have found it a most useful in- 
rument, they have felt that it is very heavily 
weighted with verbal items and has yielded an 
lequate measure for many children from 
lturally limited homes. Further, the test has 
t lent itself too well to diagnostic analysis 
ilities because of the diverse types of items 
ifferent age levels. 

When the Wechsler Intelligence Scale for 
( lren was constructed, the psychologists 
‘ eager to try it out, for it seemed to an- 
many of their criticisms of the Binet. It 
led items to measure similar abilities at 
re levels, permitted more opportunity for 
nostic analysis, gave a performance rating 
well as a verbal and full-scale rating, and 
material that was appealing to children 

| simple for the examiner to administer. 
Before accepting an instrument like the 
WISC to take the place of the Binet, however, 
psychologists felt that not only would it be 


T} 


he writers would | 


like to acknowledge the as- 
ce of the following psychologists who partici- 
| in the testing program: Nathan Altrowitz, 
r Berger, Myra Z. Goodman, Louis Granich, 
Griswold, Henry Hansburg, Georgiana T. 
iry, Marion Jenkins, Judith I. Krugman, 


ry 


rothy B. Lee, Louise B. Lowenstein, Edward 
s, Esther Mullen, Josephine McGurk, Bertha 
son, Bernice Plotkin, Morris Speevack, Lillian 
w, Thelma Voorhis, Hazel Wertman, and Mar- 
t Williamson. Mr. Jacob Abramson, of the staff 
Bureau of Educational Research, contributed 
iterially to the analysis of the data. 
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necessary to try it out to substantiate their 
clinical impressions of its values, but they 
would have to study the results in comparison 
with the Binet results. he latter had been 
used as the measuring rod in making place- 
ments for special classes for children with re- 
tarded mental development in accordance with 
the provisions of a state law. 

Dr. Morris Krugman, who was at that 
time chief psychologist in the Bureau of Child 
Guidance, and Dr. J. Wayne Wrightstone, 
Director of the Bureau of Educational Re- 
search, arranged with Dr. David Wechsler to 
administer the WISC for standardization pur- 
poses. Dr. Wechsler was to use those cases 
which fitted into his standardization scheme 
and to return the results of these and other 

ises so that the findings for New York City 
children could be studied and could be com 
pared with the Stanford-Binet results. 

The testing was to be undertaken by psy 
chologists of the Bureau of Child Guidance 
and the Bureau of Educational Research, and 
the results were to be treated statistically by 
the Bureau of Educational Research. 


PROCEDURI 


Schools were selected to give representation 
to some of the varied neighborhoods, socio 
economic strata, and ethnic groups which make 
up New York City. In all, 18 schools in the 
five boroughs were used. The average IQ’s 
on city-wide tests given in the sixth or seventh 
grades in these schools ranged from 88 to 118 
on the Pintner Intermediate General Ability 
Test, with a median IQ of 100 for the 18 
schools. Children were tested at each age level 
from 5'4% years to 1514 years, if their ages 
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were within 1'4 months of the half-year. An 
equal number of boys and girls were tested 
with both the WISC and Form L of the Re- 
vised Stanford-Binet. The order of presenta 
tion of these tests was alternated, the WISC 
administered first in half the cases, the Binet 
first in the other cases. Wherever possible, the 
tests were given on successive days. In a few 
instances children had had Binet examinations 
several months earlier. If these were given less 


than six months previously, they were in- 
cl led. 


In the 1946-1947 school year 21 psycholo 
gists (19 from the Bureau of Child Guidance 
and from the Bureau of Educational Re- 
administered the tests to children in 
the 18 schools. The findings reported here are 
based upon the 332 records returned (166 
boys, 166 girls) for whom both WISC and 
Binet results were available. Of these, 171 
records had been used by Wechsler as part of 
his standardization population [1]. 


search) 


7 Al 
MEAN DIFFERENCES OF S-B AND WISC 


MEAN IQ’s AND SIGNIFICANCE OF 


FuLt Scave IQ’s at EAch Ace Levet AND FOR ToTAL Group 


Age Level "Mean 


sinet 
W IS¢ 


107.26 
96.08 
Binet 


WISC 


111.3 
101.87 
Binet 
WISC 


105.02 
101.07 
§ Binet 


Wisc 


110 
104 
Binet 

WISC 


116 

106. 5 
Binet 
WISC 


111 
105. 


Binet 
WISC 


108.3 
101. 


Binet 
WISC 


107.00 
100.41 
105.33 

98.67 


Binet 
WISC 
14-15 Binet 
WISC 


Total Binet 
WISC 


95.00 
91.25 


108.45 
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TABLE 1 

BETWEEN STANFORD -Bra 

AnD WISC VerBAL, PERFORMANCE 
AND FULI 


CORRELATION 


SCALES 


668 


9 


126 
642 
783 
834 
SSO 
691 
697 
715 


884 
739 


THE FINDINGS 
As a first step in the treatment of the ob 


tained data, product-moment correlations were 


~~) 


SE y D 


1.18 


5.066 
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TABLE 3 


MeAN IQ’s AND SIGNIFICANCE OF NV 
VERBAL IQ’s aT EACH AGE 


SD 


{EAN DIFFERENCES OF S-B AND WISC 
LEVEL AND FoR Tora. Group 


Ave Level N Mean SEy D SE, t P 

Binet 38 107.26 14.90 2.451 . 

9 2 1 Oo; Ql¢ d l 
WISC 38 97.92 13.20 2.170 ).34 4.91 ’ 
Binet 38 111.87 14.30 2.352 ; 

10 + i 

WISC 38 103.97 15.20 2.500 
Binet 43 105.02 11.60 1.790 ; P 
WISC 43 103.28 10.65 1.644 1.19 ah , , 
Binet 44 110.86 15.20 2.137 ae p 1 
WISC 44 106.32 12.15 1.852 ° oat 
Binet 31 116.84 21.40 3.905 7 . inti wat ai 
WISC 31 107.00 14.70 2.682 : , 

10 Binet 29 111.66 16.55 3.129 3.97 1.68 36 0 P 2 
WISC 29 107.69 10.40 1.966 “a 5. . s 
Binet 37 108.35 15.45 2.575 - p 2 

4 x | » UZ 
WISC 37 104.57 11.95 1.975 ' . 

12 Binet 22 107.00 13.50 2.948 sl " — 

WISC 22 104.27 10.50 2.293 mts —_ , . 
Binet 30 105.33 11.65 2.161 : ; 1.49 7 p , 
50 f ) U0 
WISC 30 102.83 12.20 2.263 é 7 
14-15 Binet 20 95.00 15.35 3.521 44 P ' 
> ) 1 9 2 > > 20 
WISC 20 93.00 16.00 3.670 : 4 ! 
tal Binet 32 108.45 15.85 871 . ; ' 
WISC 2 103.37 13.57 745 5.08 0.61 8.46 P< 01 
mputed between S-B IQ’s and those ob- tained mean difterences in IQ at cat h age 
ned on the WISC Verbal, Performance, level. 


Full Scales at each age level, and for the 


total group. These correlation coefficients, all 


f which proved to be significantly different 
m zero at the .01 level, are presented in 
lable 1. 
In general, the correlations between the 
S-B Scale and the WISC Full Scale are higher 
n those obtained between the S-B Scale and 
ther the WISC Verbal or Performance 
Seales. Of these, the Verbal Scale is more 
losely correlated with the S-B than is the Per- 
formance Scale, except at the youngest age 
rels. 
As a second step in the consideration of the 
lata, the mean IQ’s obtained by the pupils at 
h age level were determined. These data 
re presented for the S-B Scale and the WISC 
Full Scale, in Table 2, which also summarizes 
the results of testing the significance of the ob- 





All but one of the obtained differences be- 
tween mean I[Q’s at each age level are signifi- 
cant. As a result, the obtained mean difference 
of more than 7 IQ points for the total group 
is also significant. To be sure, a consideration 
of the results for the WISC Verbal and Per- 
formance Scales, presented in Tables 3 and 4, 
respectively, makes it evident that the obtained 
differences in mean IQ between the Stanford- 
Binet and the WISC Full Scale may be at- 
tributed, in large part, to such differences as 
exist between the Stanford-Binet Scale and 
the WISC Performance Scale. 

At almost every age level, the obtained 
mean [Q’s are above 100, indicating that the 
sample chosen was slightly above average, al- 
though it was hoped that an average sample 
would be obtained by selecting representative 
schools. It is evident that the small number of 


J. I. KRUGMAN, J. JUSTMAN, J. WAYNE WRIGHTSTONE, AND M. KRUGMAN 


TABLE 4 
Mean IQ’s AND SIGNIFICANCE OF MEAN DIFFERENCES OF S-B AND WISC 
PERFORMANCE IQ’s at EACH AGE LEVEL AND FoR ToTAL Group 


\ Mean SD SE D SEp 


107.26 14.90 
95.11 12.25 


111.8 14.3 
WISC ; 98.97 14.0 


Binet 3 105. 11.60 
W LS< 98. 12.0 
Binet 110.5 15.20 
Wisc 101.55 12.40 


9 Binet 3 116.8 21.40 
WiISC 3 104.42 12.25 


10 Binet 111. 16.55 
WISC 103 12.50 
11 Binet 108.35 15.45 
WISC 98. 12.25 
12 Binet 107 13.50 
WISC 95 12.00 
13 Binet 105.3 11.65 
WISC 93.5 10.80 
14-15 Binet 95 15.35 
WISC 91 13.55 


Total Binet 33 15.85 
WISC 


—— REVISED S-B,FORM L: MEAN 1Q*108.5 
-~--. WISC VERBAL ‘MEAN 1Q=103.2 

WISC PERFORMANCE : MEAN IQ= 98.3 
—-—- WISC FULL *MEAN 1Q= 101.2 


PERCENTAGE OF CHILDREN 
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Fic. 1. Distributions of IQ’s on Revised Stanford-Binet, Form L, and Wechsler Intelligence Scale for 
Children. Percentages of 332 children given both tests who attained given IQ levels. 








STAN FORD-BINET AND THE WECHSLER INTELLIGENCE SCALE FOR CHILDREN 


TABLE 5 
NUMBER AND PERCENTAGE OF CHILDREN SHOWING GIVEN IQ DIFFERENCES BETWEEN 
STANFORD-BINET AND WISC VERBAL, PERFORMANCE, AND FULL SCALES 


“Wisc 
Verbal 
% 


1Q Difference 


Higher on S-B 
1-10 points 
11-20 points 
21 or more points 


Total 


Higher on WISC 
1-10 points 
11-20 points 
Total 


No difference 


WISC 
Performance 
N % 


WISC 
Full 





ases utilized at each age level made it virtual- 
ly impossible to secure complete representative- 
It should be noted, too, that the selection 
of schools was based upon group-test results 
which, as many comparative studies have in- 
licated, do tend to give results several points 
than individual examinations. In this 
onnection, too, it may be well to observe that 
lerman’s standardization groups 
howed mean IQ’s above 100 [2]. 
Table 3 presents a similar comparison be- 
een the obtained mean IQ’s for the S-B and 
WISC Verbal Scales. For the total group, a 


ness, 


1 
wer 


generally 


erence of approximately 5 points in 1Q was 

tained between the S-B and the WISC Ver- 

il Scales. This difference proved to be signi- 

nt at the .01 level. Examination of the ob- 

ned results at each age level indicates that, 

the most part, smaller differences appear. 

large measure, the results for the total 

ip may be attributed to the relatively large 

lifferences noted between the test results at the 
lower age levels. 

Table 4 presents a similar comparison be- 
tween the obtained mean IQ’s for the S-B and 
WISC Performance Scales. Not only is the 
mean difference in IQ significant for the total 
group, but the same situation holds true in all 
but one of the instances in which separate age 

els were considered. 

Consideration of the results presented in 
Tables 2, 3, and 4 indicates that the means 
tor the total group are all significantly higher 


on the Revised S-B, Form L, than on any of 
the WISC Scales. At every age level, the 
WISC Verbal Scale mean IQ is closer to the 
S-B IQ than either WISC Full 
Scale Scale mean IQ. At 
ever) the WISC Pertormance 
Scale mean IQ is lower than the WISC Ver 
bal Scale IQ. 

Distributions of the IQ’s on each of the 
WISC Scales and on the S-B are presented in 
Figure 1. Inspection of the data upon which 
the distributions presented in Figure 1 are 
based reveals that the range of results on the 
S-B is greater than that on the WISC Scales 
at the upper levels. The highest 1Q’s observed 
on the WISC were 145 on the Verbal Scale, 
143 on the Full Scale, and 136 on the Per- 
formance Scale, compared with 167 on the 
S-B. Throughout the distribution, the WISC 


mean the 


or Performance 


age level, too 


Performance Scale shows the greatest discrep 
ancy from the S-B. 


The distributions presented in Figure 1 are 
concerned with group results. A summary of 
individual differences in IQ’s obtained on the 
two examinations is presented in Table 5. 
When one compares the S-B results with any 
of the WISC Scale results, one finds that the 
large majority of children obtained higher re- 
sults on the S-B. Thus, on the S-B Scale, ap- 
proximately 76 per cent of the group-of 332 
children were rated higher than on the WISC 
Full Scale or Performance Scale and 65 per 
cent were rated higher than on the WISC 
Verbal Scale. 
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Conversely, a minority of the group ob- TABLE 6 ag 
tained higher ratings on the WISC Scales — CORRELATION BETWEEN STANFORD-BINET IQ aAnp hi 
approximately 20 per cent on the Full Scale DIFFERENCES BETWEEN STANFORD-BINET IQ AND ni 
and Performance Scale, and approximately 32 —" a Sa lenag-~ aaa iaaa 
per cent on the Verbal Scale. pvt Datetenti reel cn : lo 
Analysis of the size of the differences reveals th 
wide variations in individual cases. Thus, for z = 3 FE: er 
the WISC Full Scale, differences ranged from tos tim s eo le 
2 points higher on the WISC to 38 points a a ee =. fr 
higher on the S-B. For the Performance Scale, = 220 a 20 e w 
differences were even wider, ranging from 20 [ es —s n hi 
points higher on the WISC to 54 points high- I 
er on the S-B. 38 363° 1" 449° fi 
On the WISC Full Scale and Verbal Scale, —- a pon O 
approximately two-thirds of the group ob- 2 aa 604°* 564e0 "63600 sI 
tained IQ’s which were within 10 points of 31 5538 77348 691" pe 
their S-B [Q’s, whereas on the Performance 29 683** .630** .683°* 
Scale, on which there were greater differences, il 37 434% .672°* 659%" st 
slightly more than one-half of the group ob- 12 22 468° 448° 15 S 
: . nm 0 ‘ . ’ 13 30 314 -§979%* 438* 
tained 1Q’s within 10 points of their S-B IQ’s. rr po 199 aage ‘249 al 
One-fifth of the group obtained S-B IQ’s 21 : ji pone ra q 
, ; “ Total 332 .546** .625** 533%? : 
or more points higher than their WISC Per- ————_—— . 
formance IQ's, as compared with approxi  «-aigziament Ggsrent from sere ot the 26 lor 
mately one-twentieth of the group on the other 
WISC Scales. Performance, and WISC Full Scale 1Q’s. 
Although the preceding analysis has indicat- These data are presented in Table 6. 
ed that one may expect differences to appear There is a definite tendency for greater dif- t 
between WISC and Stanford-Binet IQ’s, and ferences between Stanford-Binet and WISC 
that the latter would tend to be the higher 1Q’s to be associated with the higher Stanford- t1 
score for the large majority of children, an ad- Binet IQ’s. Not only may this generalization tl 
ditional question may be posed: Will the great- be advanced for the total group of 332 pupils, 
er differences between WISC IQ’s and Stan- but it holds true at almost every age level. tt 
ford-Binet IQ’s tend to be associatéd with In relatively few instances do the obtained cor- 


higher or lower Stanford-Binet scores? In 
order to determine the answer to this question, 
correlations were computed between Stanford- 
Binet IO and obtained differences between 
Stanford-Binet IO and WISC Verbal, WISC 


relation coefficients fail to reach an acceptable 
level of confidence. 

An analysis of the data for the WISC Full 
Scale for the total group of 332 children is 
presented in Table 7, which gives the percent- 


TABLE 7 


PERCENTAGE OF CHILDREN AT GIVEN STANFORD-BiNet IQ Levets SHOWING GIVEN 
DIFFERENCES BETWEEN STANFORD-BINET AND WISC Futt Scave 1Q’s 











Higher on S-B Higher on WISC 











1-10 11-20 21or more 1-10 11-20 No Mean Diff. 

S-B IQ N points points points points points Diff. (S-B— WISC) 
130 and over 28 14.3 46.4 35.7 3.6 +19.3 
120-129 46 32.6 45.7 10.9 4.3 6.5 +10.5 
110-119 83 59.0 25.3 6.0 9.6 0.0 + 8.5 
100-109 84 45.2 23.8 25.0 2.4 3.6 + 4.7 
90- 99 56 44.6 16.1 30.4 1.8 7.1 + 2.6 
80— 89 23 47.8 4.3 34.8 8.7 4.3 + 0.7 
Below 80 12 50.0 33.3 8.3 8.3 


+ 0.2 
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age of children at each IQ level who were 
higher on the S-B or WISC, as well as the 
number of points higher. 

Although the number of cases with IQ’s be- 
low 90 and above 130 are small, the trend 
throughout is consistent in indicating that dif- 
erences tend to increase with increase in IQ 
level. ‘Thus, the mean differences increase 
from 0.2 points higher on the S-B for those 
who had S-B IQ’s below 80, to 4.7 points 
higher on the S-B for those with S-B IQ’s of 
100 to 109, to 19.3 points higher on the S-B 
for those who had S-B I1Q’s of 130 or higher. 
Only children with S-B IQ’s of 110 or higher 
showed differences, in favor of the S-B, of 2? 
points or more. 

Throughout, there were fewer children who 
showed higher ratings on the WISC Full 
Scale than on the S-B. When such children 
are noted, they tend to be found more fre- 
quently at the lower S-B IQ levels, and the 
difference between the S-B and WISC IQ’s 
s generally small. The lower the IQ, then, 
the more likely the WISC and S-B ratings 
agree. 

An additional factor is worthy of considera- 
tion, and may also be expressed in the form 

a question: Is there any relationship be- 
tween chronological age and difference be- 
tween I[Q’s obtained on the two tests? The 
inswer, in this instance, may be determined 
through the use of the chi square technique. 
A cross-tabulation of chronological ages and 
IQ differences between the Stanford-Binet and 


TABLE 8 

Cross-TABULATION OF CHRONOLOGICAL AGE AND 
DIFFERENCE BETWEEN STANFORD-BINET AND 

WISC Versa Sca.e IQ’s 








IQ Differences (S-B - WISC Verbal 








5 7 
on > = 
M L. " Pe ; Tr a) 
: . ® © - @ Buses 
Ne © = Ss ) = = § + 
< = + + | > o 
56 8 5 13 146 10 24 76 
7-8 16 16 13 19 15 8 87 
9-10 12 6 9 10 11 12 60 
11-12 9 15 11 11 6 7 59 
13-15 12 7 11 10 7 3 50 
Total 57 49 57 66 49 S54 332 
x? = 35.228 
02> P> .01 


WISC Verbal Scales is presented in Table 8. 

A significant relationship exists between CA 
and difference between Stanford-Binet and 
WISC Verbal I[Q’s. In general, the younger 
the child, the greater the difference in favor 
of the Stanford-Binet. The results of this ap- 
proach, to be sure, confirm that growing out 
of the data summarized in Table 3. 


TABLE 9 
Cross-TABULATION OF CHRONOLOGICAL AGE AND 
DIFFERENCE BETWEEN STANFORD-BINET AND 
WISC PERFORMANCE SCALE IQ’s 


1Q Differences (S-B—WISC Performanc« 


© 7s 2 ae 
7 5 
k 3 s 3, § 8 8G Q 
gS s« £2 es : : = 
a = ~ 
Sn 2 7 3 18 12 Ie 9 9 76 
7-8 13 10 16 17 7 9 5 10 87 
9-10 8 5 ” 8 10 ¢ 4 il 60 
11-12 5 10 8 10 7 3 x 8g 59 
13-15 7 5 6 12 5 4 6 5 50 
Total 35 37 41 65 41 38 32 43 332 
x? = 36.943 
20 > P> .10 


Table 9 presents a similar cross-tabulation 
of CA’s and IQ differences between the Stan- 
ford-Binet and WISC Performance Scales. 
Analysis of the data fails to reveal a significant 
difference between CA and IQ differences. 
Here, too, the obtained results parallel those 
reported in Table 4. 


TABLE 10 


Cross-TABULATION OF CHRONOLOGICAL AGE AND 
DIFFERENCE BETWEEN STANFORD-BINET AND 
WISC Fu tt Scate IQ’s 


IQ Differences (S-B- WISC Full) 
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5-6 a 10 17 21 24 76 
7-8 24 17 24 x 14 87 
9-10 11 11 16 » 14 60 
11-12 19 8 14 9 9 59 
13-15 9 13 12 9 7 50 
Total 67 59 83 55 68 332 

x? = 35.731 


P< .01 
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Table 10 presents a third cross-tabulation 
—that between CA and IQ differences be- 
tween the Stanford-Binet and WISC Full 
Scale 1Q’s. In this instance, as well as that 
reported for the WISC Verbal Scale, the ob- 
tained relationship is a statistically significant 
one. Here, too, the indications are that young- 
er children will tend to show a larger differ- 
ence, in favor of Stanford-Binet IQ, between 
Stanford-Binet and WISC Full Scale 1Q’s 
than older children. 


CONCLUSIONS 


1. For the total group of 332 pupils tested, 
correlations of .739, .644, and .817 were ob- 
tained between the Stanford-Binet, Form L, 
and the WISC Verbal, Performance, and Full 
Scales, respectively. 

2. Correlations between Stanford-Binet 
and WISC Full Scale IQ’s vary between ap- 
proximately .75 and .90 at various age levels. 
These tend to be somewhat higher than those 
obtained between the Stanford-Binet and the 
WISC Verbal Scale IQ’s, which vary from 


approximately .65 to .90. The latter, in turn, - 
are higher than those noted between Stanford- ° 


Binet and WISC Performance Scale IQ’s,: 


which range from approximately .50 to .75. 
3. The mean IQ’s on the Revised Stan- 
ford-Binet, Form L, are higher than those on 
the Wechsler Intelligence Scale for Children. 
The widest discrepancy is between the Stan- 
ford-Binet and the WISC Performance Scale. 
The WISC Verbal Scale 1Q’s are closer to 
the Stanford-Binet (mean difference 5.1 
points) than either the WISC Performance 
(mean difference 10.2 points) or Full Scale 
IQ’s (mean difference 7.3 points). For the 
332 children between 514 and 15!/ years in 
this study, the mean IQ’s were 108.5 on the 
Stanford-Binet, 103.4 on the WISC Verbal 
Scale, 98.3 on the WISC Performance Scale, 
and 101.2 on the WISC Full Scale. 

4. These obtained differences between 
Stanford-Binet and WISC Full Scale IQ’s 
proved to be significant at all but one age 
level. Differences between Stanford-Binet and 
WISC Verbal Scale I[Q’s tended to be signifi- 
cant only at the younger age levels. Differ- 
ences between Stanford-Binet and WISC Per- 
formance Scale [Q’s are significant at all but 
one age level. It is evident that the obtained 


differences in mean 1Q between the Stanford- 
Binet and WISC Full Scale may be attribut- 
ed, in large part, to such differences as exist 
between the Stanford-Binet Scale and the 
WISC Performance Scale. 

5. The study of individual results showed 
that approximately two-thirds of the children 
obtained Stanford-Binet ratings within 10 
points of those on the WISC Full Scale or 
Verbal Scale, as against approximately one-half 
on the WISC Performance Scale. 

6. There is a definite tendency for greater 
differences between Stanford-Binet and WISC 
IQ’s to be associated with the higher Stanford- 
Binet 1Q’s. This generalization may be ad 
vanced at almost every age level, and for dif- 
ferences between the Stanford-Binet and 
WISC Verbal, Performance, and Full Scales. 
7. The indications are that differences be- 
tween Stanford-Binet IQ and WISC Verbal 
and Full Scale 1Q’s tend to be associated with 
chronological age, in that such differences are 
larger at younger age levels. The same state 
ment cannot be made for IQ differences be- 
tween the Stanford-Binet and WISC Per 
formance Scales. 


DISCUSSION 


In the present study, the WISC gave results 
similar to the Revised Stanford-Binet, Form L 
in the large majority of cases at the lower IQ 
levels, but the discrepancies appearing at the 
upper [Q levels may be considered too large 
to permit the use of the WISC in place of the 
S-B until further work has been done. It is 
true that larger discrepancies are to be expect- 
ed at higher IQ levels than lower ones — Ter- 
man and Merrill report average differences be- 
tween the two forms of the Revised Stanford- 
Binet as 5.92 for IQ’s above 130, as compared 
with 2.49 for IQ’s below 70. In the present 
study, however, the discrepancies between the 
WISC and S-B are much larger (19.3 for 
[Q’s above 130; 10.5 for IQ’s between 120 
and 129). Since the number of cases involved 
is small, further comparisons are necessary t 
see if these findings will be confirmed. 

The question can be raised as to whether 
the discrepancies are due to underestimation of 
ability by the WISC or overestimation by the 
S-B, or to a combination of such under- and 
overestimation. Only widespread work wit! 
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the WISC, comparisons of distributions of 
large numbers of representative groups, and 
clinical findings can provide an answer to this 
question. 

During the period before such data are 
available and before the evidence is conclusive, 
psychologists will and should certainly use the 
WISC, but will still feel the need for check- 
ing with the Stanford-Binet in those instances 
where clinical judgment and WISC results 
seem at variance. 

There is no doubt that the Wechsler In- 
telligence Scale for Children is a valuable ad- 
dition to the psychologist’s clinical techniques. 
The inclusion of performance material as an 


4 


integral part of the Full Scale, the possibility 
of obtaining both a verbal and performance 
rating, and the point scale approach, open pos- 
sibilities for clinical uses that should be as 
fruitful at the children’s level as they have 
proved to be on the adult scales. 


Received March 5, 1951 
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A. COMPARISON OF MASCULINITY-FEMININITY 
MEASURES 


BERNARD F. SHEPLER 


FLORIDA STATE 


HE Franck Completion Test [2], a 
projective method for measuring mascu- 
linity-femininity (hence, called MF), was 

recently introduced to the clinical field. Be- 
cause the initial results looked promising, ad- 
ditional experimentation is indicated. Franck 
[1] suggested that one of the areas needing re- 
search was comparison with other measures of 
MF. The purpose of this experiment is to 
compare the Franck Test with three standard 
measures of MF. 

lhe experimental group was composed of 57 
male and 67 female students, a 100 per cent 
sample of three classes in general psychology. 
Owing to requirements of the University, 95 
per cent of all undergraduate students enroll 
in general psychology. For this reason it is 
felt that the experimental group may be con- 
sidered an adequately random sample. 


Each subject was administered a battery of 
four measures of MF. The four components 
of this batterv were: (1) the MF Scale of the 
Strong Vocational Interest Blank [5], Form 
M for men [6] and Form W for women [7]; 
(2) the Mf Scale of the MMPI [3, 4]; (3) 
the Terman-Miles Attitude-Interest Analysis 
Test [8, 9]; and (4) the Franck Completion 
Test. 

Form M and Form W of the Strong each 
consist of 400 items. Items are checked 
“Like,” “Indifferent,” “Dislike,” or, at times, 
are ranked in order of preference. The Mf 
Scale of the MMPI is made up of 60 questions 
from a total group of 550. Questions are an- 
swered “True,” “False,” or “Cannot Say.” 
The Terman-Miles Test (Form A) contains 
456 items divided into seven subtests. It is a 
pencil-and-paper test of the questionnaire vari- 
ety, and items are checked in a similar manner 
to those on the Strong. The Franck Test is 
made up of 36 incomplete drawings, each con- 


Terman- Exp. §7 62.17 55. 
” 
/ 


UNIVERSITY 


taining a few simple lines. Each stimulus ex 
tends over approximately one square inch, 
located in the center of a cell two and one- 
quarter inches square. Subjects are instructed 
to complete the drawings in any manner that 
is enjoyable. 

The F-test was applied to determine wheth 
er the experimental sample was similar to the 
normative groups used by the authors of the 
various measures. To determine the reliability 
of scoring the Franck, Pearson product- 
moment correlations were computed for both 
male and female groups as rated by three scor 
ers. 

Critical ratios were calculated for the f 
measures of MF to determine the ability 
these measures to discriminate between ma 
and females per se. 

Correlations were computed between the 
four measures of MF for both male and fe 
male groups; correlations between the Franck 
and the three standard instruments were com 
puted to determine to what extent the four 
instruments were measuring the same aspects 
of MF; correlations between the three stand- 
ard instruments were computed for additional 
comparative puposes. 


TABLE 1 
MEANS AND SD’s For MALES AND FEMALES OF 
EXPERIMENTAL Groups AND Test AuTHorR’s Groups 


MF Men’s 
Seale Group N Mean SD 


Women’s 
N Mean SD 


0 67-57.11 45.45 
§ 130-60.81 39.15 


F 
Miles Author 130 67.42 47.6 


Strong Exp. 57 58.15 79.80 67-39.53 
Author 60.6 $81.72 150-43.4 


MMPI Exp. 24.92 5.62 67 34.97 
Author 20.44 5.13 108 36.51 


Franck Exp. 7 16.82 2.96 67 19.79 
Author 14.41 3.39 150 19.61 
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RESULTS 


Means and standard deviations for males 
ind females, experimental groups and test 
ithors’ groups, are presented in Table 1. Re- 
Its of the F-test are not included; all failed 

meet requirements for rejection of the null 
pothesis at the 5 per cent level of confidence. 

The correlations between scorers for both 
ile and female groups are not included, but 
ill were above .90. All were significant at the 
1 per cent level of confidence indicating satis- 
ictorv reliability of scoring. 


TABLE 2 
CR’s BETWEEN MALES AND FEMALES* 


MF Scale CR 


Terman- Miles 
Strong 
MMPI 
Franck 


12.35 
8.68 
10.10 
5.19 


Experimental groups only 


Table 2 presents the critica! ratios that were 
mputed to determine the ability of each of 
four measures to discriminate between 
les and females per se. All four mean differ- 
1ces are significant at the 1 per cent level. 
Correlations between the four measures of 
\fF both males and females are given in Table 
The negative correlations found in Table 3 
lue to scoring peculiarities. A “high” score 
nasculine on the Strong and Terman- Miles, 
whereas a “low” score is masculine on the 
\IMPI and Franck. The opposite obtains for 
feminine scores. 


TABLE 3 


CORRELATIONS BETWEEN MF MEAsuRES 
FOR MALES AND FEMALES 


Franck Franck Franck Strong Strong MMPI 


vs. vs. 


T-M 


vs. vs. 


vs. 
Strong MMPI T-M 


vi 
MMPI T-M 


Males ~.087 .107 -.001 565 -—499 -.656 


Females -.013 -.025 149 .669 -.551 ~.534 


As indicated in Table 3, correlations be- 
tween the Franck and the standard measures 
were not significant. The correlations between 
the standard measures were all significant 
‘rom zero at the 1 per cent level. 


INTERPRETATIONS 

Results of the F-test indicate that the ex- 
perimental sample is similar to the normative 
samples, indicating equivalent randomness has 
been obtained. 

The high correlation between scorers on the 
Franck may be accounted for by the manner 
in which the tests were scored. \Vhenever a 
discrepancy occurred between scorers, a confer 
ence was held and agreement reached 1 
sible. In many instances agreement could not 
be reached, with the result 
ten item 


pe S- 


that as many as 
differences between scorers appeared 
on a single test. When the total score of the 
individual was computed, these differences can 
celed each other out in most instance In no 
case was the discrepancy of the total 
than 2. 


rreater 
Despite the high reliability of scoring, it is 
felt that the scoring manual is in need of re- 
While some items appeared to be 
y simple to score, others were verv dif- 


finement. 
’ 


relative 
Ex- 
amples of ‘typical’ male and female comple- 


ficult due to lack of clarity of directions. 


tions were confusing owing to similarity. The 
seemingly obvious lack of discriminatory abili 
ty of some of the stimulus drawings indicates 
that an item analysis might prove to be fruit- 
ful. 

The data presented in ‘Tables 2 and 3 are 
thought-provoking. From Table 2 we 


the Franck does as adequate a job of 


ee that 
dis- 
criminating between male and female groups 
critical 
for all four measures are significant at the 1 
per cent level. Table 3 suggests that the Ter 
man- Miles, the Strong, and the MMPI are 
measuring many aspects of MF that are com- 
mon to all three instruments. All correlations 
between these three instruments are significant 
at the 1 per cent level. 

In contradistinction to this, the Franck does 
not correlate with any of the three standard 
measures. All correlations are found not to be 
significant. Taking these facts into considera- 
tion, it would seem that the Franck is measur- 
ing entirely different aspects of MF from those 
measured by the other three instruments. 
What these aspects of MF are is unknown and 
beyond the scope of this paper. Further re- 
search is indicated in the light of these find- 
ings. 


as do the other measures, i.e. ratios 
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CONCLUSIONS 

1. Critical ratios between a group of males 
and a group of females indicates that the 
Franck Completion Test has the ability to dis- 
criminate between males and females per se. 
This discriminatory ability is significant at the 
1 per cent level of confidence. Therefore, it 
may be assumed that males and females com- 
plete the stimulus drawings in a different man 
ner. 

2. Nonsignificant correlations between the 
Franck and the Terman-Miles, the Mf Scale 
of the MMPI, and the MF scale of the 
Strong indicate that the Franck is apparently 
measuring different aspects of MF from those 
measured by these three instruments. 

3. Further research is needed to determine 
what these aspects of MF are that the Franck 
appears to be measuring. 


4. Indications are that the scoring manual 


of the Franck is in need of refinement if the 
test is to be used to its fullest advantage and 
promise. 

5. The need for an item analysis of the 
Franck is suggested. 


SHEPLER 


ved March 16, 1951. 
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THE SEMANTICS OF INTELLECTUAL CLASSIFICATION 


RAYMOND CORSINI 


WISCONSIN DEPARTMENT OF PUBLIC WELFARE 


COMMON problem of scientists is 

the conversion of quantitative symbols 

into qualitative ones in some manner, 
to permit those individuals who may be unin- 
formed about the technical meanings of special- 
their import 
In psychological 
practice, this problem is a real and continuing 
ne, and it is this writer’s intention to propose 
: specific, rational solution to one case. 


ized numbers to understand 


through the use of words. 


Intelligence test results commonly are con- 
verted into IQ’s and then into statements such 
as “average,” There 

several reasons why use of such classifica- 
tory terms is preferable to the use of IQ’s, 
entiles, One 
that the use of classification terms implies a 
general “band” of values rather than a specific 
point-value, and, consequently, does not imply 
the precision that a number might. A further 
reason is that a more accurate impression as 
well as a more comprehensive one may be com- 
nunicated to statistically naive individuals 
when general terms are used. As we shall 
show, however, some very real possibilities of 

interpretation can occur, perhaps in unex- 

ted directions, by our usual systems of intel- 
lectual classification. We have attempted to 
present some data which in method and content 
may have value in resolving this problem, re- 
membering that the efforts of test constructors 
and of clinical psychologists come to naught 
f the ultimate user of psychometric data 
should misapprehend them through semantic 
lificulties. 


“dull,” and “superior.” 


standard scores, etc. reason Is 


Let us consider two standard schemes for 
i¢ transmutation of IQ’s into classificatory 
terms, the well-known systems of L. M. Ter- 
man [1] and David Wechsler [2]. Exact 
orrespondence between these two schemes ex- 
ists in two respects: both systems have seven 
‘bands” of values, and the middle portion in 


+} 


both cases is called “average” and contains 50 
per cent of the entire distribution. No other 
agreement exists. But some of the differences 
need not be considered trivial. “Superior” 
means [Q 110 to 120 for Terman, and 1Q 
120 to 127 for Wechsler. The lowest class 
has an upper limit of an 1Q of 70 for ‘Terman, 
and 65 for Wechsler ;' the highest class starts 
with an IQ of 140 for Terman, and 128 for 
Wechsler, etc. [1, p. 79: 2, p. 37 

Both these schemes are admitted by their 
originators .to be tentative, Terman referring 
to his system as “suggested” [2, p. 79] and 
Wechsler stating in reference to his system, 
“some agreement both as to system of notation, 
and interpretation of results, is obviously neces- 
sary. Such agreement could best be achieved 
through conventions established by an interna- 
tional meeting of psychologists and psychia- 
trists’” [2, p. 39]. It is quite obvious that both 
schemes were established by fiat and that each 
represents but one man’s opinion. There are 
several problems that appear: (1) How many 
“bands” of value are optimal? (2) What 
should the limits for these “bands” be? (3) 
How shall they be labeled ? Undoubtedly, each 
of these questions can be settled by “interna- 
tional conventions,”” but a more direct and sat- 
isfactory solution might be to attempt an em 
pirical investigation using the familiar tech- 
niques of psychometrics. In this paper we shall 
discuss a rational solution to the three ques- 
tions. 

THE METHOD 


The 10 terms suggested by Terman and 
Wechsler, as well as 49 others obtained from 
diverse sources, were reproduced on a sheet of 
legal-size paper. Beneath each term a 100-mil- 


limeter line was placed. Instruction sheets 


1Wechsler quotes IQ 65 as the upper limit of “de- 
fective” on page 40 and page 190, but sets it at 67 
on page 227 [2]. 
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were prepared which read as follows: 


This is an experiment to determine what certain 
words, descriptive of intelligence, mean to you. You 
are to rate each word as you understand it. Notice 
that below each word there is a horizontal line 
You are to make a mark on the line under every 
word to convey the impression the word makes on 
you. Consider the line to represent all people. 
Making a mark at the exact center would mean 
“average” or “normal” intelligence, the intelligence 
possessed by the person exactly in the middle. 
Making a mark at a point halfway between the 
center and the right hand end represents intelli- 
gence at the 75 per cent point. Making a mark at 
a point one-tenth of the length of the entire line 
from the lower end means, of 
as represented by people at the 10 per cent point. 


course, intelligence 


To summarize, make your mark where you feel 
it belongs in describing people’s intelligence. View 
the line as a percentage indicator with 0 per cent 
and 100 per cent at the ends, 50 per cent in the 
middle, and 25 per cent and 75 per cent halfway 
between the left end and the center and halfway 
between the center and the right end. 


SUBJECTS 


Five populations of judges of ten individuals 
each were used to attempt to get sufficient 
judges of each type, and a rather wide diversi- 
ty. Group A consisted of intelligent adults, 
acquaintances of the writer; Group B consist- 
ed of inmates of a New York State Prison 
who had been assigned to the prison school as 
teachers of elementary and high school 
courses; Group C consisted of psychologists, 
four being clinical psychologists and six being 
graduate students at Syracuse University; 
Group D was made up of sociologists — two 
parole officers, two probation officers, and six 
social workers; Group E consisted of high 
school teachers employed in the Auburn, New 
York, School System.? 

Each subject’s rating of the 59 terms was 
measured by a 100-millimeter ruler, and was 
posted. Means and mean deviations were com- 
puted for each term for each of the five groups 


2The writer wishes to thank the following people 
who assisted him by obtaining judges: Dr. Herbert 
Sanderson, of the Jewish Community Service So- 
ciety; Miss Regine Seidler, of the Des Moines Child 
Guidance Service; Mr. Larry Argiro, then of the 
Auburn, New York, School System; Mr. Bernard 
Kogon, then of the New York School of Social Wel- 
fare; and Mr. David Guralnick, of the New York 
State Employment Service. 


and for the total group of 50 subjects. Each 
term was ranked, and rank-order correlations 
were made. Because the correlation in each of 
the ten pairs exceeded a p of .99 and because 
by inspection no gross constant differences in 
means occurred between groups, the 50 judges 
were considered to form as homogeneous a 
population as any of the separate groups, and 
were, therefore, treated as one group. 


RESULTS 


The 59 terms contained some that were 
unique and awkward, such as “‘at least average 
intelligence,” “limited intelligence,” “low or 
der of intelligence.’”’ Some of the terms were 
inversions and variations on the same theme 
such as “retarded mental development,” “‘men- 
tally retarded,” “retarded intelligence.” Elim- 
inating these two classes left 23 terms, includ- 
ing the ten suggested by Terman and Wech 
sler. Table 1 lists these 23 terms together wit! 
their mean values and their mean deviations 
for our 50 judges — the terms being arranged 
in rank order. We note some unexpected facts. 
Terman placed “dullness” at IQ of 80 to 9 
and “borderline deficiency” at IQ of 70 to 


TABLE 1 


MEANS AND MEAN DEVIATIONS OF RATINGS 
OF INTELLECTUAL CLASSIFICATIONS 





Classification 


Mean Value Mean Deviatior 





Idiot 2.5 2.6 
5 


Imbecile 8 4.3 
Feebleminded 11.0 6.4 
Defective 8.3 
Moron 8.0 
Inferior 8.1 
Backward 7.6 
Dull 8.3 
Limited 7.9 
Borderline 10.3 
Retarded 7.5 
Low Average 5.3 
Below Average 5.5 
Dull Normal 6.7 
Average 1.7 
Normal 3.2 
Bright Normal 5.7 
Bright 8.4 
High Average 7.7 
Superior . 8.8 
Very Bright 6.4 
Very Superior 5.2 
Genius y 3.6 








THE SEMANTICS OF INTELLECTUAL CLASSIFICATION 


TABLE 2 


MEAN RATINGS AND MEAN DEVIATIONS OF CLASSIFICATION 


Terms 


BY Five PopULATIONS OF TEN JUDGES EACH 


(A, Intelligent Adults; B, Prison Inmate Teachers; C 


A B 
M MD M 


Classification MD 


Genius 95 5.0 98 
Very Superior 91 5.6 89 
Superior 77 9.1 72 
High Average 11.5 68 
Bright Normal 4.0 61 
(Average 0.5 48 
Dull Normal 7.4 34 
Low Average 6.5 31 
Borderline 33 
Dull 9.0 27 
Inferior 9.4 27 
Defective 12.6 10 
Feebleminded 6.8 6.1 
Imbecile 2.4 2.0 


1.8 
8.4 
11.6 
8.3 
7.8 
4.1 
6.6 
7.5 
15.8 
8.3 
8.8 
6.2 
4.9 
4.0 


7.6 
2.3 


80. Wechsler rated ‘‘dull normal” at 80 to 90 
and “borderline” at 66 to 79. Kuhlmann 
placed “dull” at 85 to 94 and “borderline” at 
75 to 84. We find, however, that “borderline” 
has a higher mean value, as far as our judges 

than “dull,” and that “dull” and “dull- 
rmal” have quite different connotations. 
right normal” and “dull normal,” which 
were placed symmetrically above and below the 
nean by Wechsler, are respectively 9 points 
and 13 points away from the “average.” 
Terms near the extremes are more homoge- 
eous than terms near the center. If we take the 
VD of “average” to indicate mechanical error, 
ve find that “idiot” has 1.5 times as much 
iriation, “genius” 2.1, while “moron” has 
47, “superior” 5.1, and “borderline,” the most 
heterogeneous term, has a 7D 6.1 as large as 

erage.” 

Table 2 contains the mean values and mean 
leviations of the ten terms suggested by Ter- 
nan and Wechsler plus four others for our 
ive groups of judges. We note the psycholo- 
gists’ averages exceed the average for the total 
‘roup in eight cases, are exactly equal in two 
ses, and are lower in four cases. By inspec- 
ton, we notice some differences, but they do 

t appear to be in any direction to make the 
tormidable task of analysis of significance 
worthwhile. It is of interest to note that 


, Psychologists; D, Sociologists; 


E, High School Teachers.) 
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M MD ] M 


94 8 m 98 
91 5 : 

80 

67 
62 : 

50 ). 1 

32 8 4.1 
42 7 5 7.0 
28 4 8.5 
32 8.8 8.2 
28 8.0 9.5 
18 6.7 6.7 
14 7.7 5.8 
9.8 5.7 5.9 
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“borderline” has, most probably, a significantly 
lower value for the psychologists than for 
other classes, and this is also the case for “dull 
normal,” which undoubtedly reflects the in- 
fluence of special training, and which implies, 
insofar as these words are concerned in this 
context, that psychologists are using familiar 
words in a special way. 


A NEW SCALE 


Can we compare our rating values with any 
known scale, so that we can give these terms 
empiric IQ values? This is a moot point, for 
although the directions for the judges implied 
a centile scale, whether or not we can so con- 
sider it, is another question. Perhaps we could 
for the psychologists, but inasmuch as our aim 
is not to find what one special group thinks, 
but what a diverse sample of people feels, we 
see no point in using the psychologists alone. 
Accepting, tentatively, the hypothesis that our 
rating scale is a centile scale, and that our 
group mean values are mean affective centile 
values, when we arrange these terms on a 
centile scale and compare their placement as 
suggested by Terman and Wechsler, we find 
in Figure 1 that the values by fiat and the 
values by experiment differ considerably in 
some cases. The area from 25 per cent to 75 
per cent has been reserved by both Terman and 





490 


Wechsler for the range of “average,” but our 
judges place “dull,” “borderline,” ‘‘dull nor- 
mal,” and “bright normal” within this range. 
If we look back at Table 1, we find a total of 
14 terms from “inferior” to “high average”’ 
being included in the established range of 
“average,” and even “superior” comes to with- 
in one point of inclusion! 
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Fic. 1. Centile positions of various classification 


terms as suggested by Terman and by Wechsler 
and as found by empiric ratings. 


If we attempt to create a new system for 
the classification of intelligence using our data, 
we are faced with problems 1 and 2 as stated 
above: how many classes to have, and where 
to place the limits of each class. We could 
use Wechsler’s mathematical scheme, but this 
is unacceptable due to the too-wide range of 
“average.” We could use Terman’s multiples 
of IQ points, but this has no mathematical 
logic. One solution might be to use deciles, 
combining the two mid-deciles into one, and 
having nine “bands” of value. Using usual and 
the more homogeneous terms, and making some 
adjustments, we have Table 3. Although this 
scheme is not presented as ideal, on the basis 
of the information at hand it appears superior 
to Terman’s and Wechsler’s systems in that 
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there is a closer reflection of what people act- 
ually take these terms to mean. 


TABLE 3 
A SCALE oF INTELLECTUAL CLASSIFICATION 
Mean 
Empiric Ciassification 
Value 


Decile 


IQ Range 


121- 
114-12 
109-113 


90 Very Superior 
77 Very Bright 
76 Superior 

67 High Average 105-108 
50 Average 96-104 
ow Average 92- 9 


3 1 
27 I 


ferior 87- 91 


IX 17 Defective R0— 86 


X 1 eebleminded - 79 

This suggested scale places the two extreme 
classifications not at the middle of the deciles, 
but at the inner ends. No good term for the 
second decile appeared in our original 59 
terms. “Very bright,’ although unacceptable 
both in connotation and in value, is the best 
compromise. Giving “average” a range of 20 
centiles is much less generous than what has 
been suggested, but seems to reflect more 
adequately what our judges felt. For “‘feeble- 
minded,” we can only say that, even though 
experts place only 3 per cent of our population 
in this area, our judges were more generous. 

If we are committed to the theory of modi 
fying our classificatory system to fit current at- 
titudes, rather than attempting a fiat system 
which makes arbitrary placements, a scale built 
in this fashion would seem to create less mis- 
understanding between psychologists and the 
public. 

SUMMARY 


We have considered one instance of the 
problem of qualifying quantities, examined 
two standard systems of intellectual classifica 
tion, and found certain discrepancies. The 
proposal that an international convention con- 
sider the matter of establishing a standard 
method is stated to be inferior to obtaining 
empirical data for the solution. By using a 
rating scale, mean values for 59 terms rated 
by five populations of ten people each, identi- 
fied as intelligent adults, prison inmate teach- 
ers, psychologists, sociologists, and high school 
teachers were obtained. Eliminating awkward 
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and duplicating terms, we have 23 terms which 
are further considered. By accepting our 
rating scale as equivalent to a decile scale, we 
find certain gross differences between the 
placement of terms as Suggested by Terman 
and Wechsler, and our empirical results. A 


rational scale of nine “bands” of values, using 


deciles, combining the fifth and sixth deciles, 
is suggested as a closer approximation to sub- 


jective impressions of test users. 
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PSYCHOLOGICAL TESTS OF RESEARCH SCIENTISTS’ 


ANNE ROE 


NEW YORK, N. Y. 


HIS paper reports some special aspects 

of a research project which has been car- 

ried on for the past three and a half 
years. The major part of the project was the 
clinical study of eminent research scientists in 
various fields of science, and a consideration of 
differences and similarities among them in life 
histories and test performances. The full re- 
ports of the study have been published else- 
where [2, 3]. This paper presents certain 
interrelations between the tests used that are of 
particular interest. An earlier paper presented 
similar but less complete data on the biologists 
alone [4]. 

The tests employed were the Rorschach, the 
TAT, and a Verbal-Spatial-Mathematical 
(VSM) test constructed for the purpose by 
the Educational Testing Service, the three sub- 
tests being separately scored. The verbal test 
is an antonym test; the spatial test requires 
identification of drawings of the same object 
seen in different positions, and the mathe- 
matical test is chiefly one of mathematical 
reasoning. The mathematical test was not dif- 
ficult enough for the physicists, so that they 
are not included in any correlations using it. 


The subjects in the total study included 64 
ranking research scientists: 20 biologists, 22 
physical scientists, and 22 social scientists ( psy- 
chologists and anthropologists). For three of 
these subjects there are no test data available, 
and there are some deficiencies in the data for 
the others, as is evident in the tables. These 
subjects were selected for their eminence in 
research, in the estimation of their peers. All 
are men, all but one are American-born, and 
all received the bulk of their training here and 


1Part of a study financed by a grant from the 
Division of Research Grants and Fellowships of the 
National Institutes of Health of the United States 
Public Health Service. 


were still scientifically productive when 
studied. They range from 31 to 60 in age. 
Details on selection are given in the mono- 
graphs, which also contain tables of raw data 
not reprinted here. 


EFFECT OF AGE ON TEST PERFORMANCE 


Correlations with age are given for various 
aspects of test performance in Table 1.? The 
subtests of the VSM are entered first in the 
table. Clearly, only the spatial test shows any 
effect of age, within this age range, but this is 
great enough to be of some importance. 


TABLE 1 
Errect oF AGE ON Test PERFORMANCE: 
CORRELATIONS 


Correlation 


Test Items with Age 


Verbal Test, number right — .108 
Spatial Test, number right 57 ~.397 
Mathematical Test, number right +-.001 


Rorschach: R — 194 
T/R 5 -.15 
W -.068 
M 204 
ZC 61 204 
F% 61 +113 
F+% 61 —.119 
ITS 61 .222 
TAT: length of story 61 —.193 





Not all the Rorschach determinants are ap- 
propriately handled by the usual correlational 
techniques, but a few which seem of particular 
interest, and which have wide ranges and rea- 
sonably normal distributions, have been includ- 
ed. None of these correlations is significant at 
the 5 per cent level (r == .25 for this sample). 

The correlation between age and length of 


2All calculations were done by Mr. Lassar Got- 
kin. 
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TAT story is also not significant. The meas- 
ure of length is a crude one — the average 
number of lines of typescript for the stories for 
each man. These protocols were all recorded 
in shorthand; there are doubtless some omis- 
sions of words or phrases, but the record is 
reasonably accurate. 

It would seem, then, that within this age 
range, and with the exception of the spatial 
test, variability in age does not have an im- 
portant influence on test performance. 


VSM TEST INTERCORRELATIONS 


These correlations, given in Table 2, are 
based on raw scores, i.e., on the number of 
items answered correctly in each subtest. Only 
the correlation between the verbal and spatial 
subtests is significant. This is not materially 
changed if age is partialled out. Descriptions 


TABLE 2 
VERBAL-SPATIAL-MATHEMATICAL TEST 
INTERCORRELATIONS: NUMBER RIGHT 


Verbal Spatial 
Spatial 


Mathematical 39 


of work on the spatial test would confirm that 
it can be done in different ways. This test was 
a rather unsatisfactory one, inasmuch as its 
significance is not clear. The distribution of 
scores was platykurtic, and its correlations 
with other tests (not used in this study) are 
onfusing. 


PRODUCTIVENESS OR OUTPUT 


It can be hypothesized that there is a general 
factor of productiveness or ease and amount of 
output for the various tests, which might be 
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manifested in such ways as the amount of 
work attempted in each test. For the VSM 
subtests these would be the number of items 
which the subject had tried to do (computed 
as the last one marked; if some items were 
omitted before that it was assumed that he had 
read and at least considered them), for the 
Rorschach the total number of responses, and 
on the TAT the average length of the stories. 
The correlations are given in Table 3. Two 
of the VSM correlations are low, but signifi- 
cant, but it is not certain that this is not pri- 
marily a speed factor inasmuch as this test was 
timed and some subjects did not finish all the 
items. But the correlation between verbal 
“tried’”’ and Rorschach R is also significant, al- 
though low. Length of TAT story and num- 
ber of responses on the Rorschach have a posi- 
tive correlation of .499, which is the highest in 
the table. 


RELATIONS BETWEEN VSM AND 
PROJECTIVE TESTS 


Relations between scores on the VSM and 
certain aspects of projective test performance 
which seemed relevant were studied by the cor- 
relations given in Table 4. Here are also 
given partial correlations which indicate the 
relative effects of age and total number of re- 
sponses on the Rorschach (R). Total number 
of responses on the Rorschach clearly affects 
the total number of any particular kind of 
Rorschach response, and with the range in R 
as wide as it is in this group, this must always 
be considered. The mathematical test shows no 
significant relations with any of the other 
elements studied, but it is possible that there 
is some relation between this and percentage 
of Rorschach responses using Form alone 
(F%). (Correlation between R and F% in 
this group is —.028.) The group is small inas- 


TABLE 3 
INTERCORRELATIONS OF “OUTPUT” 


S tried M tried 








“TAT length 
N r N r N r 
+ .317® 39 
Spatial, no. tried 39 
Mathematical, no. tried 
_ Rorschach, R 


*P = .05 or less. 





Verbal, no. tried +.369* 


+ .225 


+.116 
+ .095 
— 237 
+.499° 








494 


ANNE ROE 


TABLE 4 


CORRELATIONS BETWEEN NUMBER RIGHT ON VSM Susrtests AND VARIOUS Projective Test MEASURES 
PARTIAL CORRELATIONS WITH AGE AND R. 


Verbal Right (n= 59) 


Tas  #i~$&° 


9 y | 
~ “&. 


ae 2 12-age 
Rorschach 
R {. 
ITS ,007 
W + .153 + 
M +- 221 + 
F% ~ .003 + 
F+% -.137 — 
=C 
TAT 
Length —, 


219° -+-.265 
+ .018 
-146 
.204 
.009 
152 
aar” 


== 116 
— .053 


+ .322* 
+ .005 
— 1354 
— 393° 


580° 


*P — .05 or less 


much as the physicists did not take this test. 

The spatial test shows significant relations 
only with the measure of the adequacy of form 
on the Rorschach (F + %), and age and R 
are apparently irrelevant. 


The verbal test is positively correlated with 
total numbers of responses on the Rorschach 
and with number of human movement re- 
sponses (7) when the effect of total responses 
is discounted. The negative correlation with 
sum of color responses (2C) is very high 
when R is partialled out. This is a rather con- 
sistent picture — the verbally skilled person, 
productive on the Rorschach, and rather in- 
troversive. The strong interest in 
shown also by the high M is also to be noted. 
I have elsewhere speculated that the reliance 
upon verbal modes of thinking and the high 
verbal skills demonstrated by individuals who 
are much concerned with personal relations 
may be due to the fact that words are our 
chief means of interpersonal communication, 
and those to whom this is important develop 
verbal facility. The negative relation with 
sum of color responses would indicate a much 
stronger degree of introversiveness than seems 
probable in this total picture. This is very 
likely due to the manner in which XC is com- 
puted. The FC responses are very common in 
those with high M [5, and shown also in my 
data], but they are weighted only half as 
strong as the CF responses; hence, the picture 
may be distorted. Interrelations within Ror- 
schach elements themselves are not appropri- 
ately considered here, but they will be dis- 


persons 


.054 
.062 
O15 
.000 


Spatial Right (= 57) 


2. Math. Right (n — 39 


12-R = es 


12 12-age T12-age 12-R 


.048 — .030 
+.159 
+.108 
— .073 
+ .049 
+ .321° 
— 128 


.074 
213 
045 
137 
+ .234 
+ .199 
—.021 


.075 
219 
046 
140 
235 
201 
.022 


+ .040 
+ .040 
— .020 
+ .002 
+.$37° 
—.103 


.340* 
025 


+ .145 


cussed at length in another paper. 

The negative correlation between verbal 
right and TAT length is somewhat difficult to 
interpret. The correlation between 
items tried and TAT length was negligible 
although positive. Perhaps the point is that 
precision in the use of words is not related to 


verbal 


fluency in their use. 


SHADING SHOCK AND VSM SCORES 


Ihe commonly accepted interpretation of 
shading shock as a pervasive, basic anxiety 
leads to speculation as to whether the presence 
of such anxiety has a disrupting effect upon 
intellectual performances of various sorts. Ii 
so, it might be manifested in lowered scores, 
or it might be manifested in greater scatter on 
test scores. A simple test is insufficient in so 
complicated a situation, but it seemed worth 
while to examine these data in this respect. 
The results are shown in Table 5, which gives 


TABLE 5 
SHADING SHOCK ON RORSCHACH AND VSM Scores 
AND SCATTER 


Shading Shock 
Strong Mild Absent 
N=16 N=23 N- 


VSM Scores 


Verbal right, ave. 
Spatial right, ave. 
Mathematical right, ave. 
V-S, Scatter in @ scores 


53.5 
11.1 
12.9 


1.0 


58.0 
9.8 
14.0 
1.1 


60.1 
11.9 
16.8 

0.7 


mean )’, S, and M scores and mean scatter for 
the subgroups of those with strong, mild, and 
no shading shock. The measure of scatter 's 
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the difference between sigma score on verbal 


and on spatial tests. (The mathematical test 
was not taken by the physicists.) Although 
means for verbal and mathematical tests in- 
crease regularly with decreasing scatter, analy- 
sis of variance gives F’s of 1.00 and 1.20, 
which are not significant. On these data, then, 
it cannot be shown that shading shock has any 
effect on test score. 


MISCELLANEOUS COMPARISONS 


A few further comparisons, which seemed 
interest, have been made between Kor- 
schach and TAT performance. 

Length of TAT stories correlates + .555 
with number of whole responses (//) on the 
Rorschach, which drops to + .339 when R is 
partialled out, but is still significant. The 
point of this seems to be that the longer stories 
are also likely to be more complete, to encom- 
pass more, and this partakes of the same at- 
titudes which result in large production of 
whole responses on the Rorschach. 

One would also like to know whether orig- 
inality on the two tests was related. For vari- 
ous reasons, it seemed preferable to use P 

popular responses) on the Rorschach, rather 
than O (original responses). The measure on 
the TAT for each subject was the number of 
inusual stories he gave, plus one-half the 
number of stories to which he gave a distinctly 
nusual twist. In one instance, the measure 
s one of conformity, in the other, of departure 
from conformity. They should have a negative 
orrelation if they have any basis in common. 


The correlation is completely insignificant, 
+ 005. 


Finally, Piotrowski’s remark that the hu- 


man movement responses (1) “indicate the 
testee’s preoccupation with the future’’ stimu- 
lated another computation. The extent to 
which the subjects completed their ‘TAT sto- 
ries by including a future was extremely varied, 
even though they were specifically asked for 
this. ‘lo give stories with a future is not neces- 
sarily a measure of the extent to which the sub- 
ject is preoccupied with the future, but it 
would seem probable that there is some rela- 
tion. Therefore, a correlation was run be- 
tween the number of human movement re- 
sponses on the Rorschach and the number of 
stories which included the future. This is + 
.297, and, although low, is significant at the 
2 per cent level. 


SUMMARY 


This paper has reported some comparisons 
of test results on a verbal-spatial-mathematical 
test and the Rorshach and the TAT given to 
61 eminent research scientists. 
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SELF-EVALUATIVE QUESTIONNAIRES AS 
PROJECTIVE MEASURES OF PERSONALITY 


GABRIEL ELIAS 


THE UNIVERSITY 


ECAUSE of their recognized limita- 

tions, self-rating questionnaires are be- 

ing rapidly replaced by projective devices 
in individual testing. Self-rating inventories 
are prevented by their very nature from meas- 
uring those layers of the personality which the 
testee is unwilling or unable to evaluate. Un- 
fortunately, the various attempts to develop 
group projective devices [5, 8] have not pro- 
duced techniques which are sufficiently practi- 
cal to replace questionnaires for group testing 
as well. 

The success of tests such as the Rorschach 
and the TAT stems largely from the fact that 
they give holistic interpretations of the person- 
ality Gestalt rather than isolated readings for 
individual traits. 
a qualified clinician are required to interpret 
or “inspect” [10] the record, regardless of 
whether the test is given to groups or to in- 
dividuals. The resulting expense is usually 


As a result, the services of 


more then most testing programs can afford. 
Because of the factor of expense, most at- 
tempts to improve group testing techniques 
have consisted of paper-and-pencil tests which 
purport, through some indirection or ingenul- 
ty, to elicit more honest answers than are ob- 
tainable by less subtle direct questioning [2]. 
These efforts will hold little promise so long 
as they continue to interpret self-ratings as 
direct measures rather than as mirrorings of 
the 
testee and how honest his answers, the fact 


individual. No matter how sincere the 
remains that his self-evaluations must be super- 
ficial and incomplete because they must neces- 
themselves 
areas of which he is consciously aware 


sarily confine essentially to the 
— and 
these we now know are often the least signifi- 
cant ones in an individual’s personality [6]. 
Because projective devices interpret a testee’s 
answers as behavioral reflections rather than 


as self-ratings of his personality [8], they of- 
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ten tell more about the individual than he 
knows himself. Much the same occurs in in 
telligence testing, in which the individual’s test 
behavior is scored as a sample reflection of his 
Capacity. 

These considerations suggest that it may 
be possible to develop practical (easily ad- 
ministered and instruments, that 
would prove more objective and penetrating 
than the current run-of-the-mill personality in 
ventories, by scoring self-rating questionnaires 


scored ) 


as expressions rather than as ratings of person 
ality. If, as is widely believed today, the in 
dividual is a “holistic” (“biosocial” or “or 
ganismic’’) indivisible whole, we would expect 
the testee, who is rating himself, to mirror his 
personality in addition to evaluating it. 

If we apply these principles more specifically 
we can predict that the answers a testee gives 
to a true-false-? type of self-rating health 
ventory would reflect a number of different 
personality characteristics. This study in 
gated the following three predicted relat 
ships between the answers of testees on s 
nventories and their security-insecurity fe 
ings at the time: 
should 


1. Insecure testees answer 4 


i.e., complain of illness) in larger percentages 


+h 


than secure individuals. 


This hypothesis as- 
mes that, since the individual is a biosocial 


whole, “‘psychological” inadequacies such a 
security will be reflected in complaints 
physical illness. 

2. A larger percentage of insecure than 
secure testees should select ? answers. This 
hypothesis is supported partly by findings of 
other investigators that the tendency to select 
the ? answer to such questions is a consistent 
personality characteristic [3] and by clinical 
evidence which considers disproportionate 
certainty a strong indicator of personal in- 
security. 
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The more unstructured in (a) content, 
ind (6) wording an item is, the greater the 
tendency of the insecure to exceed the percent- 

of secure testees who complain of illness. 

\n unstructured question permits greater lee- 

for personality projection; therefore, it 

ild reflect personality tendencies more sen- 

The de- 

sree of structure of an item was determined 

the extent to which its content and word- 

so controlled the inquiry situation as to 

eliminate sincere (i.e., unwitting) individual 
lifferences in its interpretation. 


sitively than more structured items. 


PROCEDURE 
An 112-item questionnaire devised by A. H. 
Maslow entitled Health and Hy- 
ne Inventory” was given to several hundred 
Brooklyn College students. Several weeks 
later Maslow’s Security-Insecurity (S-1) In- 
ntory [5] was given without in any way re- 
lating the two tests.‘ For the purposes of this 
periment, the Maslow S-I Inventory was re- 


“General 


garded as a criterion of security-insecurity. 
The 122 (27 per cent of the 
up) who scored most favorably on the S-I 
re taken as a “secure” criterion group, and 
were compared with an 


individuals 


“insecure” criterion 
roup of the 122 testees who made the most 
[ favorable scores, 

The first 75 items of the health inventory 
were true-false-? questions. The remaining 37 

re similar in content to related items of the 
true-false group, but were in different form. 
which is typical of the true- 


Item number 47, 
lse questions, reads: 


“Do you consider your- 
self a rather nervous person?” The answer 
hoices are Yes, No, and ?. Item number 83, 
‘h is representative of the remaining 37 
luestions, is similar in content to item 47. It 

ids: “T have been nervous,” 
foll wing four 


and offers the 
choices, Every day, 
veral times, One or two times, Not at all. 
While this report is based on analysis of the 
results obtained for the first 75 items, the 
trends reported below appear to be confirmed 
by the results obtained for the remaining ques- 


answer 
Q 


tan 


To verify part a of the third hypothesis 


an! 


1¢ writer is indebted to A. H. Maslow for ad- 
stering these tests and making their results 
lable to the writer. 
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(which relates the structure of item content to 
insecurity) the author compared the health in- 
ventory Group F 

Group 3 ‘Table | ‘The questions of 
Group F inquire about illnesses, such as scar 
let fever and chicken pox, which we 
} 


items of with the items of 


(See 


ordinarily 
ascribe to ourselves only after a physician has 


A definite and obj 


criterion, a medical diagnosis over which the 


rendered a diagnosis ‘ctive 


testee has no control, determines whether he 
had the illnesses referred to in this group; the 
highly 


content 


content of these questions is, therefore, 
On the other hand, the 
of the questions of Group EF is 


structured. 
clearly less 
structured; they refer to conditions, such as 
colds and insufficient weight, which the in- 
dividual ordinarily diagnoses for himself with- 
out benefit of an outside criterion. 

To investigate part 4 of the third hypothesis 
(relating structure of wording to in- 
the author studied 9 pairs of health 
inventory items. They were selected because 
in the opinion of the author the questions of 
each pair resembled each other closely in clini 


item 
security ), 


cal content but differed sufficiently in wording 
structure to enable the author to designate one 
as the better structured item in each 
These pairings and structure evaluations (See 
Table 2) 


sions. 


case. 


were tested on two different occa- 
In 1946 the definition of item structure 
was typed on sheets on which the 18 questions 
were listed in order. These sheets 
were given to 18 graduate students of psy- 


random 


chology attending a seminar in attitudes con- 
ducted by H. H. Remmers at Purdue Uni- 
versity with the request |{ a) to vroun the tems 
into 9 pairs by matching each question with the 
one other which resembled it most in 
clinical content, and (4) to designate the better 


item 


structured question of each resulting pair. To 
check further on the accuracy of the writer’s 
judgments, a similar study was undertaken at 
a meeting of faculty members and graduate 
students of psychology at the University of 
Rochester. After explaining the definition of 
structure used in this study, the author read 
the 18 items, two at a time, as he had original- 
lv paired them: after each pair was read. the 
audience was requested to indicate by a show 
of hands which was the better structured item. 

The writer’s opinions were confirmed by 
both studies. Fourteen of the 18 items were 
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paired in the same way as in Table 2 by 5 or 
more of the 18 Purdue judges, and, for each 
of these 14 pairs, a majority of the judges des- 
ignated as better structured the one so desig- 
nated in the table. “he Rochester results were 
also in accord, for the majority of the group 
agreed with the author as to which item was 
better structured for 8 of the 9 pairs. 


RESULTS 


All three hypotheses were confirmed. 

The results summarized in Table 1 clearly 
confirm the first hypothesis. The 75 items of 
the health questionnaire are grouped into 8 
categories according to similarity of clinical 
content to permit quick appraisal of the re- 
sults. The critical ratios reported are for the 
differences between percentages of secure and 
of insecure who complained of illness. For 
category H, a minus sign precedes the lower 
end of the critical ratio range, because in the 
case of two items of that group the secure 
group alleged illness in greater percentages 
than the insecure. For no other item in any 
of the categories did this occur. 


TABLE 1 
IreEM CATEGORIES AND RANGES OF CRITICAL RATIOS 


Clit | Content 


CR Range 


“Psychological” Moods, e.g., “Do 


you often feel just miserable?” 20.00 to 6.3 


Tiredness, Sleep, and Rest, e.g., 
“Do you almost always sleep 
well ?” 10.80 to 5 

Fear, e.g., ‘ you ever been 

afraid of going insane ?” 8.02 to 

Evaluation of Own “Organic” 
Well-Being, e.g., “Do you 
usually feel well and strong ?”’ 

Illnesses and Ailments Which Are 
Usually Self-diag 


10.00 to 


nosed, e.g., “Do 
7.19 to2 


you have colds frequently ?” 


Organic IInesses, the Existence 
of Which Are Customarily Con- 
firmed by Medical Diagnoses, e.g., 
“Did you ever have scarlet 
fever?” 

Autonomic Rhythms, e.g., ““Does 
your stomach get upset easily ?” 6.52 to 1.67 

Overt Symptom Behavior, e.g., 
“Did you ever have con- 


3.25 to 0.00 


vulsions ? 1.00 to —0.50 


*For all except two items of category H, the insecur 
answered ‘“‘yes” (i.e., alleged illness) in as large or 
larger percentages than the secure. 


GABRIEL 


ELIAS 


For 71 of the 75 items the insecure professed 
illness in larger percentages than did the 
Of these 71 items, 49 produced criti 
cal ratios of 3 or more; the ratios for 9 other 


items were 2 or more. 


secure, 


Furthermore, of the 4 
items that did not follow the trend of the 71, 
2 were answered in identical proportions by 
both the secure and the insecure; and the other 
2 (the 2 of Group H in which the insecure 
alleged illness in smaller percentages than the 
secure) produced critical ratios of less than 1 

The third hypothesis was supported with 
regard to both item content and wording 
structure. As appears in Table 1, the items of 

up E (whose content was poorly struc 

TABLE 2 


BETWEEN ITEMS OF SIMILAR CONTENT’ 


RUT DIFFERENT IN STRUCTURE 


Poorly 
Item 
Statement 

»f Item 


Structured Better Structured Item 


otatement 
of Item 


47 Do 


yourself a 


you sit still 
without 


you consider ‘an 


rather nerv- fidgeting ? 
oO is person? . 
62 Do you usually 
70 Do 


feel well rested 


you usually feel fatigued 
when you wake 
when vou wake in the morn- 


10.80 ing? 


in the morning? 


12 Do you have 
c Ids fre- 


quently ?” 


26 Doc lds some 
times lay you out 


for a long time? 


1 Do you usually 5 Do you have any 
feel well and outstanding 
trong? physical defects? 
Have you ever | 72 Do things some- 
had spells of 


dizziness? 


times swim or get 
misty before 


your eyes? 

20 Do you get head- 
aches fre- | 67 Do you havea 
quently? great many bad 
headaches ? 
21 Do you get nau- 


seous easily? 22 Do you faint 


easily? 
74 Do your eyes often 
pain you? | 16 Can you see well 


without glasses? 
23 Do your bowels 


usually function 


well? 


| 60 Are you troubled 
4.52) 


much by 
constipation ? 








SELF-EVALUATIVE QUESTIONNAIRES AS PROJECT E MI 


tured) differentiated much more clearly be- 
tween the secure and the insecure than did the 
better structured The 
range of ratios for Group E was 7.19 to 2.00, 


ones of 


Group F. 


> 


ind for group F, 3.25 to 0.00. The results 
yrted in Table 2 show that the same tend- 
vy holds for differences in wording structure. 
For each of the 9 pairs, the question which the 
thor (and both the Purdue and Rochester 
dges) judged to be poorly structured, dif- 
ferentiated more sensitively between the secure 
1 the insecure than did the better stuctured 
Apparently, the less structured the con- 
tent or wording of the questions, the more did 
testees’ answers express differences in security- 
insecurity feelings. 


DISCUSSION 


Practical 


etail 


considerations 
The writer’s 
ial concern was a very practical one: Why, 
iew of the recognized inadequacies of per- 
sonality they remained so 
group testing? Why have 
failed to develop more acceptable tools? 
The answer appears to lie in the lag be- 
tween our theoretical postulates and their ap- 
ition. 


theoretical 
present 


and 


in the study. 


inventories, have 


nopular for we 


Practically every psychologist and 
hiatrist, be he a behaviorist or field theorist, 


Freudian or an Adlerian, accepts the “‘hol- 
” “organismic,” 
The 


le. He expresses his personality in his be- 


or “‘biosocial”’ concept of 
mality. individual behaves as a 
Yet, rating questionnaires must, per- 
force, violate this principle. When, as in 
pinion polling, we seek the most objective an- 
possible, we usually try to word our ques- 
ns and structure our test situations so as to 
ninate all room for projection of personality 
We hope thus to avoid subjectivity and 
relevant personality reflections. But is this 
ible? We know that testees lie, repress, 
nd distort when rating themselves. A self- 
rating, when it is valid, is accurate only be- 
ise the testee’s entire personality is such that 
can properly evaluate himself under the 
rcumstances. Because a self-rating, accurate 
' inaccurate, is behavior and an expression of 
he entire individual, its very accuracy or in- 
uracy will depend on his personality. 
The findings of the present study indicate 
that even when answering a simple question 


+ 


ASURES OF PERSONALITY 


health, 
feelings of security or 
that 
instances in 


— = 
regarding his an individual projects 


They show, 


ince it 
nsecu®lity. 


moreover, personality projects itself in 


such fashion, and, 


predictable 
gained by con 
hnigq 1¢s 


therefore, that much can be 


structing paper-and-pencil te which 


} 


apply rather than fl holistic theoretical 


aunt 
considerations. We would do better to design 
tests so as to predict personality from projec 
tive behavior than to seek to devitalize a test 
in a fruitless attempt to prevent it from elicit 
ing personality projection. 

The writer has already completed one rath 
er extensive study based on these principles [5] 
in his construction of the Elias Family Opin- 
ion Survey. This test measures personal feel- 
projectively. A 
testee is asked to express his opinions regarding 
“the general family life that existed in your 
community when you were a child.” A typical 
question is 60, which reads, “Children love one 


ings of homey-homelessness 


of their parents in ways that aren’t nice.” In 
the process of expressing his attitudes regard- 
ing general family life, he projects his own 
feelings, and these reflections are as objectively 
and easily scored as are the self-ratings of typi- 
cal attitude and personality inventories. Vari 
ous studies with the test prove it to be re- 
liable and valid [5, 12]. 

In another application of this study’s find- 
ings, one of the author’s students gave the 
adult form of the California Test of Person 
ality and 32 selected items of Maslow’s Gen 
Health and Inventory to 82 
University of Arkansas psychology students. 
He found that a 
centage of those whose scores fell into the most 
maladjusted 


eral Hy, giene 


significantly larger pet 
California 
to the health in 
ventory than did those who scored in the most 
adjusted quarter (P [y?] < .05). This finding 
can be extended to attitude testing and any 
other area where T-F-? questions are used. 
Are the results of these tests somewhat dis 
torted by the 


stronger insecure in- 


quarter on the 


test 


gave tour or more ? answer 


factors 
tendency of the 
dividuals to select the ? answer to questions 
irrespective of their content? 


personality such as 


more 


SUMMARY AND CONCLUSIONS 
Despite the admitted inadequacies of self- 
rating questionnaires, they are by far the most 
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A PSYCHOMETRIC STUDY OF DRINKERS' 


PETER J. HAMPTON. 


WESTERN RESERVE UNIVERSITY 


HE basic purpose of this study was (1) 

to compare a group of alcoholics with a 
group of comparable nonalcoholics on a 
mber of tests; (2) to select, on the basis of 
the comparison, those test items with respect 
to which statistically significant differences be- 
alcoholics and nonalcoholics were 
found; (3) to construct from these items a 
eliable and valid personality questionnaire 
for drinkers which might differentiate not only 
oholics from nonalcoholics, but among dif- 
rent categories of alcoholics and nonalcohol- 


tween 


PROCEDURE AND RESULTS 


A battery of tests consisting of a Personal 
story Questionnaire, the Otis Self-Adminis- 
ring Tests of Mental Ability, the Strong 
ational Interest Blank for Men, and the 
linnesota Multiphasic Personality Inventory 
s administered to a group of 84 male alco- 
lics and a comparable group of 84 male non- 
holics. Criteria used for inclusion of sub- 
ects in the experimental group of the test try- 
t part of the study were as follows: (1) 
membership in Alcoholics Anonymous, (2) 
nouncement by one or more psychiatrists 

t the subject was an alcoholic, and (3) 
pitalization or commitment to jail for al- 
holism. The criteria used for inclusion of 
bjects in the control group of the test try- 
t part of the study were: (1) inapplicability 
ontrol subjects of the three criteria used 
tor the selection of experimental subjects, (2) 
ubject’s own statement that he was not an al- 
‘cholic, and (3) close approximation in age, in- 
telligence, education, and socioeconomic level 


‘From a thesis submitted to the Graduate Faculty 
‘ Western Reserve University in partial fulfillment 
' the requirements for the degree of Doctor of 
Philosophy. This report was read at the Annual 
Meeting of the Canadian Psychological Association, 
Toronto, May 1950. 


of control subjects to experimental subjects 

Test results for alcoholics and for non 
alcoholics were reported and expressed in 
means, standard deviations, and f-ratios. ‘Tests 
revealing statistically significant differences in 
mean scores between alcoholics and nonalco- 
holics were submitted to further study. An 
item analysis of the Minnesota Multiphasic 
Personality Inventory and parts of the Person- 
al History Questionnaire revealed a fund of 
items with respect to which statistically signifi- 
cant differences were found between alcoholics 
and nonalcoholics. For most of the responses 
to these items the differences between alco- 
holics and nonalcoholics were of a magnitude 
that would occur less than once in 100 times 
as the result of sampling errors; in nearly all 
of them, the differences would occur no more 
than 5 times in 100 as a result of sampling 
errors. 

Discarding all the items with regard to 
which a critical ratio of less than 2.00 was ob- 
tained left 125 
Multiphasic Personality 
items from the Personal 


items from the Minnesota 
Inventory and 31 
History Question- 
naire. Responses to these items were weighted 
in accordance with Strong’s Tables of Net 
Weights for Differences in Per Cents, and the 
items were included in a trial Personality 
Questionnaire for Drinkers. Part 1 of the 
study thus resulted in the development of a 
trial questionnaire which differentiated alco- 


holics from nonalcoholics. The next step was 
to find out whether this trial questionnaire 
would differentiate other alcoholic and non- 
alcoholic groups. 


In part 2 of the study a new sample of al- 
coholics and nonalcoholics was used. The trial 
Personality Questionnaire for Drinkers was 
administered to 250 male subjects. Altogether, 
100 male alcoholics, separated into groups of 
50 secondary alcoholics and 50 week-end alco- 


501 





502 PETER J. HAMPTON 


holics, were compared with 150 comparable 
male nonalcoholics, separated into groups of 
50 social family drinkers, and 50 social occa- 
sion drinkers, and 50 near or total abstainers. 
Inclusion in the various groups was by self- 
rating. 

Test results for alcoholics and nonalcohol- 
ics, and for each subgroup, were again record- 
ed and expressed in means, standard devia- 
tions, and t-ratios. Significant differences 
were found not only between the mean scores 
of the alcoholic and nonalcoholic groups, but 
also between secondary alcoholics and week- 
end alcoholics, social family drinkers, social 
occasion drinkers, and near or total abstainers ; 
between week-end alcoholics and social family 
drinkers, social occasion drinkers, and near or 
total abstainers; and between social family 
drinkers and social occasion drinkers and near 
or total abstainers. 

Test norms, validity, and reliability data for 
the trial Personality Questionnaire for Drink 
ers were obtained from the check sample pop- 
ulation. The validity of the trial Personality 
Questionnaire for Drinkers was investigated 
by the extreme-groups method and the expect- 
ancy table technique; the reliability of the trial 
Personality Questionnaire for Drinkers was 
checked by the split-half method and the test 
retest method. Both the validity and reliabili 
ty (test-retest coefficient of reliability 87 
corrected split-half coefficient of reliability 
89) of the test were found to be acceptable. 
Finally, a tentative cutting score of 59 on the 
Questionnaire, dividing alcoholics or potential 
alcoholics from nonalcoholics, was arrived at 


by inspection of ogive curves, 


CONCLUSIONS AND IMPLICATIONS 


It has been demonstrated in this study that 
significant differences in responses to items 
from a Personal History Questionnaire and 
the Minnesota Multiphasic Personality In 
ventory exist between alcoholics and nonalco 
holics of similar backgrounds. This finding 
supports the hypothesis that alcoholics can be 
adequately differentiated from nonalcoholics 
on the basis of differences in responses to cer 
tain items on the Personal Historv Question 
naire and the Minnesota Multiphasic Person- 
ality Inventory. 


It has further been shown that the t 
Personality Questionnaire for Drinkers, con- 
structed with items from the Personal History 
Questionnaire and the Minnesota Multiphas 
Personality Inventory with respect to w 
statistically significant differences between al- 
coholics and nonalcoholics were found, whe: 
administered to new samples of alcoholics and 
nonalcoholics, differentiates not only alcohol 
from nonalcoholics, but different categories « 
alcoholics and different categories of nonalc: 
holics from each other. In particular the Pe 
sonality Questionnaire for Drinkers has 
shown to differentiate (a) alcoholics 
nonalcoholics, (4) secondary alcoholics from 
week-end alcoholics, social family drinkers 
social occasion drinkers, and near or total ab 
stainers, (c) week-end alcoholics from social 
family drinkers, social occasion drinkers 
near or total abstainers, and (d) social fan 
drinkers from social occasion drinkers and near 
or total abstainers. 

The Personality Questionnaire for Drinkers 
is believed to have sufficient validity and 
liability to be of value as a screening techn 
for drinkers. It may also be of value 
ruide in vocational guidance, employee sele 
tion, and clinical diagnosis. 


Results of this study show that mean 


for different groups of drinkers on the Per 
hits : 


Questionnaire for Drinkers rise in n 
ide as alcohol increases in importan 
the life pattern of the variously class 
rroups. The more maladjusted the indiy 
s on the average, the more need he seems t 
show for alcohol as a crutch. The results 
far as they are representative of other po; 
lation samples, therefore may indicate that 
dividuals who show high scores on the Per- 
onality Questionnaire for Drinkers, 
tive of present status of drinking, m 
pt to use alcohol as an escape under extrem 
motional stress. The Personality Question 
naire for Drinkers will, of course, have to | 
ed with caution until further research cor- 


lings. 


borates our fin 
Results of this study also suggest that 
vereas near or total abstainers find little or 
no use for alcohol in their life adjustments 
they show, on the average, fewer favorable re 
ponses on the Personality Questionnaire for 
Drinkers than do social drinkers. On first in 
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pression this suggestion is startling. Upon ex- 
amination, however, the facts are not so dis- 
parate after all. Near abstainers and especial- 
total abstainers may also depend upon the 
ncept of “drinking” as a crutch — but in a 
lifferent way. Drinkers seem to depend upon 
lrinking largely as a defense mechanism; ab- 
stainers may depend upon not-drinking as a 
Jefense mechanism. This explanation is sug- 
gested on the basis of responses made by tctal 
abstainers to item number 97 on the Person- 
ality Questionnaire for Drinkers, which reads, 
I believe that a person should never taste an 
alcoholic drink.” The majority of total ab- 
stainers answered this statement by encircling 
True.” In addition, a number of total ab- 
stainers amplified their feelings about drinking 
by such write-ins as, “I have never touched 
or and I hope I never will!” and “I would 
rather be dead than touch the stuff!” 
Finally, no uniform personality pattern was 
nd on the Personality Questionnaire for 
Drinkers which was characteristic of all in- 
nperate drinkers. There were several pat- 
terns, however, which of their in- 
lence may be presumed to characterize large 
mbers of the statistical population of alco- 
Analysis of 


because 


Personality 
Questionnaire for Drinkers indicative of dis- 


items on the 


bling psychological abnormality suggests the 

resence of the following clinically defined 
nstellations in the personalities of alcoholics: 
order of occurrence they are: neurotic, psy 

pathic, compulsive, hypomanic, schizothy 
and homoerotic. 


SUGGESTIONS FOR FURTHER RESEARCH 


The statistical treatment of data and the 
nclusions discussed in this study should not 
onsidered final statements. The construc- 
n of screening techniques for psychometric- 
differentiating primary alcoholics from 
secondary alcoholics, secondary alcoholics from 
week-end alcoholics, week-end alcoholics from 
social family drinkers, social family drinkers 
‘rom social occasion drinkers, and all of these 
‘rom total abstainers, is a recent development. 
Extensive research will be necessary if such 
«reening techniques as the Personality Ques- 
tionnaire for Drinkers are to be used wisely in 
lagnosis, prediction, and control of “drink- 
ng” behavior. The following suggestions are 


h cor: 
t that 
ttle or 
ments 
ble re 
ire for 


rst in 


503 


made to encourage further research on screen- 
ing techniques for drinkers. 
1. Search for signi/ While the 


present study has located a substantial number 


cant items. 


of test items which significantly differentiate 
different from each 
other, the search for such items shou!d be con- 


categories of drinkers 


1 ; 


tinued both by way of try-out of already ex 


isting test items and by construction of new 
items. 
2. Need for validation studies. The 


quacy of differentiating 
drinkers established by 


ade 
items for difterent 
way of 
critical ratio procedures should be checked by 
more refined statistical techniques. 
need for correlational 
variance and 


categories of 


There is 
Analysis of 
analysis would be of 
advantage. ‘There is need, also, for the estab- 
lishment of occupational norms and critical 


studies. 
factorial 


scores for different groups of drinkers. 
different 


study has 


3. Personality characteristics of 
The 


constellations of p 


types of drinkers. present 


isolated several 


among 


-rsonality 
intemperate drinkers. Correlationa! 
studies determining the degree of relationship 
between responses to items of the Personality 
Questionnaire for Drinkers indicative of dif- 
ferent personality patterns, and responses to 
items of different tests purporting to meas- 
ure these patterns, would provide a clearer 
picture of the amount of relationship between 
these patterns in the personality of different 
types of drinkers. 
4. Stability of 


.- gen 
of differ 


personality characte 
nt types of drinkers. 


personality 


risti c 
In addition to 
locating constellations among 
drinkers there is also need for stu lving their 
stability. idinal 


basis by retesting periodically to identify re 


This can be done on a longitt 
curring patterns or shifts to different patterns, 
or on a cross-sectional basis by comparing test 
results for different age levels. 

5. Moderate drinking and personality ad 
justment. Since extent and intensity of in 
temperate drinking seem to be related to per 
sonality maladjustment, it may well be that 
moderate drinking is related to personality ad 
justment. 
gated. 

6. Personality constellations of abstainers. 


This possibility should ‘be investi- 


An adequate study of “pure” abstainers should 
be made to see to what extent the compulsive 
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nature of their personality, suggested by this 
study, is related to intemperate drinking, 
moderate drinking, and no drinking. 

7. Occupational rating scale for drinkers. 
It might also be of importance to vocational 
psychologists to have competent judges, who 
are familiar with the duties of a wide range 
of jobs, rate a list of representative occupations 
on a five-point scale for extent of drinking. 


Received February 16, 1951 


REFERENCES 


HAMPTON, Peter J. The development of a per- 
sonality questionnaire for drinkers. Unpublished 
Doctor’s thesis, Western Reserve Univ., 1950. 


Manson, M. P. A psychometric analysis of th, 
psychoneurotic and psychosomatic characteristics 
of alcoholics. J. clin. Psychol., 1949, 5, 77-83 
Manson, M. P. A psychometric anaylsis of psy 
chopathic characteristics of alcoholics. J. consult 
Psychol., 1949, 13, 111-118. 

Sevicer, R. V. Are you an alcoholic? Alcoho! 
Hyg., 1946, 2, 5-10. 

Tiesout, H. M. The syndrome of alcohol 
diction. Quart. J. Stud. Alcohol, 1945, 5, 535- 
546. 

WITTMAN, Puy.tuis. A controlled study of the 
development and personality characteristics of 
chronic alcoholics. Elgin Papers, 1939, 3, 77-84 
Wortis, H., Stttman, L. R., AND HALPERN 
FLORENCE. Studies of compulsive drinkers. New 
Haven: Hillhouse Press, 1946. 





PRACTICAL CORRELATES OF PSYCHIATRIC 
SYMPTOMS 


J. R. WITTENBORN' 
YALE UNIVERSITY 
AND 
FRED A. METTLER 


COLUMBIA UNIVERSITY 


HERE is evidence that the symptoms 
manifested by mental hospital patients 
cluster together in a systematic fashion 
2, 3]. The nature of these clusters has been 
fetermined through factor analyses of symp- 
tom rating scales. The rating scales were de- 
eloped in collaboration with psychiatrists and 
sresumably are a good and representative 
mple of the types of currently discernible 
mptoms which psychiatrists consider impor- 
tant in making a descriptive diagnosis of men- 
hospital patients whose disorder could be 
scribed as functional. They are primarily 
riptive in nature and make no particular 
nand upon the theoretical sophistication of 
he rater or his skill in making psychoanalytic 
nferences. Although the scales should not be 
nsidered representative of all the various 
ls of psychiatric disorders which may be 
‘ountered, the nature of the symptom clus- 
rs which are found among these rating scales 
‘plausible from the standpoint of the descrip- 
tive concepts employed in the diagnosis of psy- 
hiatric patients. 


The factor analysis studies provide the 
basis for a quantitative weighting of groups of 
symptom rating scales so that it is now pos- 
ible to rate the symptom manifestations of a 
mental hospital patient and to score these 
vmptom ratings with respect to each of nine 
lifferent symptom clusters; i.e., it is possible 


‘This investigation as supported by a research 
grant (MH 118C, Dr. Fred A. Mettler responsible 
nvestigator) from the National Institute of Mental 
Health, of the National Institutes of Mental Health, 
Public Health Service. The writers also acknow!- 
tdge the cooperation of the State of New Jersey 
ind the staff at Greystone Park. 


to describe a given patient quantitatively with 
respect to nine different empirically determined 
symptom clusters. Tentative labels have been 
offered for these symptom clusters, and the 
general nature of each symptom cluster may 
be determined by referral to the original 
studies [1, 2, 3], which show the precise de- 
gree to which each symptom rating scale 
participates in the determination of each of the 
various clusters. 

Obviously, there are many aspects of the 
patient’s existence which are a part of, or are 
related to, the general nature of his disorder 
but which are not likely to be included in psy- 
chiatric symptom rating scales, particularly in 
rating scales which are restricted to currently 
discernible symptom manifestations. Patients’ 
scores on the various symptom clusters have 
been correlated with scores on the Wechsier- 
Bellevue subtests [4] 
Rorschach scores [5]. These correlations con- 
tribute little to the 
general implications of the symptom clusters 
and, 


and also with various 


our understanding of 
incidentally, contribute less to our con- 
fidence concerning the possibility of using 
scores from either of those tests as an aid in a 
descriptive diagnosis. 

In a recent study [6], the symptom cluster 
scores have been correlated with biographical 
factors found in the patient’s hospital records. 
This study shows that among hospitalized pa- 
tients there are significant relationships be- 
tween the patient’s score on the symptom 
clusters and characteristics of the onset of the 
patient’s disease. Similarly, biographical situa- 
tions or characteristics which existed before 
the patient was recognized as ill were shown 
to be related to current symptom clusters ob- 
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TABLE 1 
Cur SQUARES FOR TESTS OF INDEPENDENCE BETWEEN OCCUPATIONAL SCALE 
AND SYMPTOM CLUSTER SCORE 


Scale 


irenia 


Schizophrenic 
Excitement 
Acute 
Anxiety 
Manic-Depressive, 
Depressed 
Paranoid 
Schizop 
Paranoid 
Condition 
Hysteria 
Phobic- 
Compulsive 








Incapable of carrying out tasks. 
Extremely disagreeable 

Takes pride in quality of work................. 
Not reassured of doing worth-while work 
Acutely uncomfortable 

Interest aroused with difficulty 

Difficult to draw interest to a task 
Insistently argumentative 

Insists on continuous reassurance 
Spontaneously assaultive.. 

Ideas consistently bizarre 

Consistently careless worker 
Characteristically hilarious 

Oblivious to demands of task 

Unrealistically overconfident iene 
Fails to comprehend requirements of situation 
Grossly inconsiderate................. 


= 
oo 


Habitually hypercritical 


_ 
© 


Despondent, dejected, retarded 


ht 


Oblivious of feelings of others 


i 
—_ 


Unable to attend to activities 


2 
~ 


Difhculty understanding problems 
Feels people are against him 


Openly hostile and destructive 


a ne 


Beliefs unrealistic and delusiona 


OA 


Behavior hurried, impulsive 
Completely lacking in initiative 


io 9) 


Inappropriate speech and actions 


oO 


Violates rules and regulations 


YCNNNHONHNHNE 
= ~~ 


Serious lack of drive 


w 
~ 


Manneristic behavior 

Complete lack of persistence 

33. Generally unkempt ae 

34. Expends excessive effort needlessly 

35. Fastidious concern for next phase of work 
36. Lacks stamina...... Se 

37. Cannot concentrate = 

38. Reluctant when induced to begin work 
39. Verbalizations characterized by profanity 
40. Gross lack of dexterity 

41. Reacts strongly to criticism 

42. Demoralized by failure 

43. Resentful of task . 

44. Continuous speech or actions... 

45. Responds only when approached or detained 
46. Markedly overmeticulous and precise 
47. Makes critical and unkind remarks 

48. Lacks awareness of errors........... 


49. Selfish and uncooperative with others_ 
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lable 1 


Interests changeable 

Laughs inappropriately 

Speech usually terse 

Works at extremely slow p: 

Extremely submissive 

Suggestions fantastically inappropriate 
Exaggerated suspicions of others 
Efforts characterized by sporadic fluctuations 
Unconcerned about his behavior 
Negativistic and contrary 

Refuses to participate in activities 


Believes activity aggravates symptoms 


*Negative gn for chi sqvare 
ned from ratings made at the time the pa- 
ts were admitted to the hospital. The 

ot correspondence observed in this study 
of interest not only because they 


the general behavioral signifi- 


broaden 
oncept of 
of various types of symptom manifesta- 
ns, but also bec: i 
- - 


use they encourage us in 
of the biographical origins of mental 


[he symptom cluster scores have been cor- 
-d with the diagnosis actually given the 
itrists [7]. This investiga- 
le 
i 


b psychi 
of considerable interest because it has 
vn that the labels given symptom cluster 
‘S are appropriate in the sense that the 

s symptom cluster scores are most hith- 
orrelated with the psychiatric diagnosis 

customarily bears the label assigned to 
symptom cluster score. 


THE PRESENT ANALYSIS 


order to extend our knowledge of the 
ral behavioral significance of symptom 
ter scores, socially relevant aspects of the 
tient’s behavior should also be studied. Ac- 
rdingly, the present study is designed to re- 
e the kinds of socially practical behavior ob- 
tved in occupational therapy situations (and 
¢ other special services of the hospital) with 

symptomatic behavior represented by the 
mptom cluster scores. The present report 
cribes a statistical investigation in which 


(continued 


SYMPTOMS 


indicates an inverse relation 


each of 61 rating scales appropriate for oc- 


cupational therapy situations is correlated with 
The 

yup of 60 schizophrenics who 
were hospitalized at 


each of the nine symptom cluster scores.” 
patients are a gre 
Park, 

Jersey, for a total of at least three years, 


who were considered as possible candidates for 


New 


and 


Greystone 


psychosurgery. The set of occupational rating 
scales was developed by the staff of the North- 
ampton Veterans Administration Hospital in 
collaboration with the writer. 


from this rating scale would 


A typical item 
read in the fol 
lowing manner: 


1. Shows average or better ability to understand 
directions. 
2. Understands adequately 


repeated instructions. 


with examples and 


3. Capable of understanding only the routine 
aspects of a problem. 

4: Has extreme difficulty in 
most routine aspects of 


peated directions. 


understanding the 


a problem even after re 


In all cases high ratings for the occupational 
extreme of the behavior 
by the scale. Similarly, 
tom cluster scores indicate 


scales indicate an 
sampled high symp 
a high degree of 
the type of symptoms indicated by the cluster. 

The results of the analysis are shown in 


2The occupational ratings were made by occupa 
tional therapists, attendants, or nonmedical 
supervisors. The psychiatric symptom ratings were 
made independently by psychiatrists. 


other 
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Table 1, which includes a terse phrase from 
the most extreme statement of each of the 
rating scales. Table 1 shows ,? 
which are based on fourfold tables and which 
relate the patient’s status on each of the rating 
scales to his status on each of the symptom 
cluster scores. Only the 7? statistics sufficient 
ly large to be significant at the 5 per cent 
level are included in Table 1. Of the 549 pos- 
sible relations, 45 of them are significant at 
the 5 per cent level. This is approximately 
twice as many as would be expected if chance 


statist ics 


alone were determining the size of the y? sta 
tistic. 
to broaden our understanding of the behavioral 
significance of the various symptom cluster 
scores, a discussion of the results of Table 1 


Inasmuch as the analysis was conducted 


will proceed from symptom cluster to symp 
tom cluster. 

The first column in the table is for 
manic-state cluster score. There is a tendency 
for high scores on this cluster to be associated 
with item 17, grossly 
hurried, impulsive behavior ; 28, inappropriate 
speech and actions; 37, cannot concentrate; 
44, spontaneous and continuous speech or ac- 
and 49, 


the 


inconsiderate; 26, 


tion; 48, lack of awareness of errors; 
selfish and uncooperative with others. 
It is interesting to observe that the excited 
cluster score is quite different from the manic 
Actually, the two cluster scores 
have but one scale in common, number 48- 


cluster score. 


both the excited and the manic patients seem 
unaware of their errors. Patients who tend to 
have a high score on the excited cluster are 
characterized in this way: 1, incapable of car- 
rving out tasks: 6, interest is aroused with 
difficulty ; 14, are oblivious to the demands of 
a task; 
others; 22, have difficulty understanding prob- 
lems; 29, violates rules and regulations; 40, 
there is a lack of dexterity; 48, are unaware 
of their errors. 

The patients who stand high on the anxiety 
cluster score tend to have the following kinds 
of scale ratings: 1, they are not incapable of 
carrying out assigned tasks; 3, they take pride 
in the quality of their work; 12, they are not 
consistently careless workers; 14, they are not 
oblivious of the demands of a task; 16, they 
are not the ones who fail to comprehend the 
requirements of a situation; 21, they are not 


20, are oblivious to the feelings of 
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the ones unable to attend to activities; 2? 
they are not the ones who have difficulty 
understanding a problem; 28, they are not 
characterized by speech and actions inappropri- 
ate to the situation; 55, they do not 
fantastically inappropriate suggestions; and 
57, their efforts are not characterized by s 
radic fluctuations. In general, it could be in- 
ferred that in this sample of schizophrenic pa- 
tients, those who had the highest ratings on 
anxiety were the patients who in general were 
free from many kinds of occupational disabili 
ties. The patients who stand high on the ar 
ety cluster do not tend to be rated in a manr 
similar to the patients who stand high on the 
manic cluster or on the excited cluster. As 
matter of fact, the patients with high anxiety 
scores and high excitement scores tend to he 
‘haracterized in opposite respects by two oc- 
upational rating scales. 

Patients who stand high on the depressed 


cluster score tend 
fo] 


give 


to be characterized in the 
lowing respects: 8, they are not insisten 


' 


irgumentative; 15, they are not unrealisti 


overconfident; 20, they are oblivious of ¢! 
feelings of others; 22, they do have diffi 
understanding problems; 23, they don’t 


to feel that people are against them; 3 


seem not especially concerned about the : 
phase of a task; 40, there is a lack of dext 
41, they do not react strongly to critici 
they do not make critical remarks 
others; 52, their speech is terse; 61, the 
not believe that activity is going to ager 
their symptoms. 


It is most interesting to observe that the 


symptoms of the paranoid schizophren 


measured by the symptom cluster score b 

no relation to any of the 61 items sampled ! 

the occupational rating scales. In view of t! 

numerous plausible respects in which hig! 
scores on the other symptom clusters are ' 

lated to various aspects of the occupational 
rating scales, this is a conspicuous finding 
This suggests that for the most part, the dis- 
turbance of the paranoid schizophrenic is pr 
marily related to the symptoms which are of 
interest to the psychiatrist, and that the dis- 
turbance of paranoid schizophrenia does not 
detract to any particular degree from the pa- 
tient’s performance on the kinds of occupation- 
al activities in which institutional patients en- 
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gage. It is commonly reported by institutional 
workers that the paranoid patients are among 
the most useful around the hospital. It is 
srobably because of this fact that many para- 

id patients are released as competent when 
a effect they are still seriously ill and although 
possibly occupationally competent, grossly in- 
ompetent from the standpoint of social and 
terpersonal existence. 

The lack of relation between occupational 
ratings and scores on both the paranoid schizo- 
shrenic cluster and the paranoid-condition 

ister is quite striking and suggests that the 
paranoid patient, whether he be considered 
chizophrenic or not, is generally a patient 
who adapts reasonably well to the require- 

its of simple occupational activities.* 

The paranoid-condition cluster score is not 
pecially characteristic of the present sample 

1 is related to only one rating scale, and 
this one has more social significance than sig- 
nificance for competence in a simple occupa- 
ton. For scale number 47, it is shown that 
these patients tend to make critical and un- 

lly remarks about others. 


TABLE 2 
IBUTION OF SYMPTOM CLUSTER SCORES FOR THE 
SAMPLE OF 60 SCHIZOPHRENICS 


Scores 
Scores 
7 and 8 
Scores 
9 and 10 


‘a 


| 
| 
| 


nN 


phrenic 


tement 


ressed 
ranoid 
hizophrenia 
iranoid 
ndition 
ersion 
steria 
Ic- 
mpulsive 
ebephrenic 
hizophrenia 14 





Scores on the conversion hysteria cluster 
uso are seen to bear no relation to occupational 


his interpretation is confirmed by a scrutiny of 
lividual ratings. 
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aspects of the patient’s behavior. This, how- 
ever, merely reflects the fact that almost no 
patients in the present sample show strong con- 
version tendencies. 

It is noted that patients with phobic-com 
pulsive rare in the 
present sample and are not particularly char- 


symptoms are relatively 
acterized by any of. the occupational scales. It 
seems plausible that patients with phobic-com- 
pulsive symptoms, who are placed in any toler- 
able activity, tend to without 
gross handicap. There is one respect in which 
the phobic-compulsive patient is characterized 
by the occupational scales. Specifically, item 35 
shows him to be fastidiously concerned about 


would work 


the next phase of his work. 

Patients who have high scores on the hebe 
phrenic or deteriorated type of schizophrenia 
are characterized in certain respects by their 
occupational ratings. For example, 20, these 
patients are oblivious of the feelings of others; 
22, they have difficulty understanding prob- 
lems; 23, they are not inclined to feel others 
are against them; 25, they do not have grandi- 
ose or grossly unreasonable beliefs in their 
ability ; 29, they consistently violate rules and 
regulations; and 52, their speech is usually 
terse. The patients with the high scores on the 
hebephrenic-type cluster are seen to resemble 
patients who have high scores on some of the 


other symptom clusters. 
| > 


For example, items 
, being oblivious of the feelings of 
others and having difficulty in understanding 
problems, are characteristic of patients with 
high scores on the excited cluster, those who 


20 anc 


have high scores on the depressed cluster, and 


those with high scores on the hebephrenic 


cluster. The excited and the hebephrenic pa- 
tients are similar with respect to another scale, 
number 29 — they both tend to violate rules 
and regulations. It is not surprising that 
there should be some similarity between the 
excited and the hebephrenic type of schizo- 
phrenic clusters because the scales contain cer- 
tain items in common. As a matter of fact, 
in a newly admitted sample of patients in 
whom the deteriorated or hebephrenic schizo- 
phrenic symptoms are not yet manifest, it is 
probable that the excited and the hebephrenic 
cluster scores would show the same pattern of 
relationship. The hebephrenic schizophrenic 


cluster score is also correlated with some of 
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the scores which are significantly correlated 
with the depressed cluster score. ‘hey are: 
number 23, do not tend to feel that people are 
against them, and number 25, do not have an 
unrealistic delusional or grandiose notion con- 


cerning their role or status. 


SUMMARY AND CONCLUSIONS 

The occupational and practical significance 
of the empirically determined symptom clusters 
are studied by correlating each of nine symp- 
tom cluster scores with each of 61 occupational 
rating scales. The sample comprises a group 
of 60 schizophrenic patients at the Greystone 
Hospital, Greystone Park, New Jersey. With 
the limitations which this sample imposes the 
following conclusions are offered: 

1. The occupational ratings for patients 
who have high scores on the manic cluster, on 
the excited cluster, on the anxiety cluster, on 
the depressed cluster, and on the hebephrenic 
cluster are all conspicuously and meaningfully 
different from each other in a manner congru- 
ent with the general interpretation given these 
scores. The two cluster scores which resemble 
each other most with respect to their occupa- 
tional scale correlations are the excited and 
hebephrenic schizophrenia scores. 

? 


2. Patients having relatively high 


on the conversion hysteria cluster and on the 


scores 


phobic-compulsive cluster are not characterized 
by the ratings they receive on the occupational 
rating scales. No general significance may be 
ascribed to this finding, however, because the 
present sample comprises patients who for the 
most part have almost no conversion symptoms 
and relatively few phobic-compulsive symp- 
toms. 


R. WITTEN BORN 


AND FRED A. METTLER 


3. In the present sample, many patients 
who are quite paranoid with respect to both 
the paranoid schizophrenia cluster score 
the paranoid condition score are not characte; 
ized by any of the occupational disadvantage 
expressed by the occupational rating scales. |t 
is suggested, therefore, that the condition of 
the paranoid patient and the severity and social 


significance of his illness should not be evaly 
ated on the basis of his performance in o 


S 


pa 
tional therapy or other special services of me 

tal hospitals. His danger to himself and to 
society lies not in his occupational limitations 
but lies in the nature of his conception of his 
role relative to that of his fellow men. His 
false conceptions can be readily discerned b 
an examining psychiatrist and may be rated. 
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A NOTE ON FOSBERG’S “FOUR EXPERIMENTS 





WITH THE SZONDI TEST” 


JACOB COHEN 


NEW YORK UNIVERSITY SCHOOL OF EDUCATION AND 


N excellent series of experiments with 
the Szondi [2] unfortunately 
marred by an error in reasoning in- 
expectancy of varying 
imbers of selections in a diagnostic category 


was 
chance 


{ the Szondi. The error does not invalidate 
‘conclusions of the study, but since an in- 


reasing amount of research is being done with 


this instrument, it is perhaps worth while to 
et the record straight and thus prevent repeti- 


mn of the error. 
Fosberg evaluates the chance expectancy of 
ero through six “like” and “‘dislike’” choices in 


:given Szondi factor by means of expanding 


He reasons, quite 
rrectly, that since two “liked” photographs 
re chosen in each set, the probability of any 
tor being thus chosen is 2 in 8 or 14. He 
tates, however, that the probability of a factor 
“disliked” is 2 in 6, or 1/3, 


parently reasoning that following the selec- 


” 
’ 


ng chosen as 


only six 
remain from which to choose the two “dis- 
‘d.” This leads to the conclusion that it is 
nore probable that a given photograph be 
hosen as “disliked” (1/3) than as “liked” 
;). This seems wrong intuitively. 
If we reason somewhat differently [1], 
tamely, that irrespective of “like” or “dislike,” 
ir photographs are chosen from eight, and 
lf of the four are “liked” and half “disliked,” 
arrive at the same probability for both 
hich seems more reasonable), namely p = 


The error in [2] is this: It is true that 
hen two photographs are selected as “dis- 
ed,” only six remain (two having been pre- 
usly selected as “liked”), thus yielding the 
lue 2/6 or 1/3. However, for a photograph 
» be present among the remaining six, it must 
wt have been chosen as “liked.”” The proba- 
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4, the 
Thus, the 
simultaneous occurrence of the photograph not 


bility of the latter is 3/4 (1 minus ! 
probability of selection as “liked’’). 


being selected as “‘liked,” and being selected as 
“disliked” is 3/4 times 1/3 which equals 4, 
the same probability as 
“liked” (as in [1]). 


Fosberg’s error led to incorrect values for 


was obtained for 


the chance expectancies of zero through six 
“dislike” choices in a factor and further re- 
sulted in mistaken conclusions in chi square 
tests of the divergence of two empirically ob- 
tained distributions from these chance expect- 
ancies [2, Table 3]. 
chance distribution for “dislikes” 


The correct theoretical 
is the same 
as that for “likes,”’ correctly stated in [2, p. 
39]. When the chi square values are computed 
using the correct distribution, the following 
changes in results are found: 

1. From significantly different from 
chance to not significant: s— and e— in the 
normal group, and p 
atric group. 


2. From not significantly 


and d— in the psychi- 
different from 
chance to significant: hy— and 4— in the psy- 
chiatric group. 

The present author would like to stress the 
fact that these corrections in no way vitiate 
Fosberg’s conclusions and knows of no objec- 
tive experimental evidence which can challenge 
his statement that, “It may well be that there 
are psychological forces at work in determining 
the selection of pictures, but they are probably 
not what Szondi workers believe them to be” 
[2, p. 40]. 
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NE W BOOKS 
AN DUDESTS 


Books comparison with the senior author’s The Psychology 
of Behavior Disorders (1947). The present volume 
is not a revision or replacement of the earlier one 
Instead, the two books are supplementary. Behavior 
Disorders, as its name implies, was organized a { 
the major syndromes. Behavior Pathology 
ganized in terms of the person and his behay 
from beginning to end it is truly a psycholo 

little trace of descriptive psychiatry remaini 
example, there are chapters on pseudocomm 





BENNETT, GeorGF K., SEASHORE, HAROLD G., AND 
WesMAN, ALEXANDER G. Counseling from pro- 
files, a casebook for the Differential Aptitude 
Tests. New York: Psychological Corp., 1951. Pp. 
95. $1.75. 


Counselors in nine school systems provided thirty 
typical case studies of eighth-grade to twelfth-grad 
pupils who had been counseled with the aid of the 
DAT. The attractive booklet presents these case 
studies, each with comments by the authors of the 
tests, as an aid to counselor training. While the 
case studies are not assumed to prove the validity 
of the tests, they certainly demonstrate the value of 
a differentiated profile, as opposed to a single schol- 
astic aptitude score. 


autistic community, on disorganization, and 
socialization as behavior phenomena, but no 

on schizophrenia as such. The two books diff 

a much more important respect, however. B 
Disorders tended to be one-sided. It pre 

1uthor’s biosocial theory in relative isolati 

other contemporary thinking. Behavior Pathology is 
a work of greater scope, that incorporates ré 
evidence on needs, learning processes, confli: 
BERNARD, Harotp W. Toward better personal ad- anxiety, and yet retains its essential unity. All 
justment. New York: McGraw-Hill, 1951. Pp. chologists will find this book stimulating to the 
vii + 439. $4.00 thinking, whether their interests lie in clinica 
lications or in more general psychologic 


] 


. . . . ’ 
An orientation textbook, intended to guide college . ' - 
The volume will also be a use 


students toward better mental health and personal sees 
. . . . entwion 
efficiency. Its psychological theory is antiquated. 


The topic of learning, for example, is treated almost be : Ze 
entirely in terms of William James’s precepts about yt eegneine their cour e art me 3 will 
habits, and “good” and “bad” habits are dicho that the adventure is rewarding to tl 
mized freely. As a practical tool for college guid- 
ance, however, the book is not without assets. The 
author admirably avoids the temptation to mystify "ARROLL, HERBERT A. Mental hygiene, the d 

his readers or to frichten them with morbid ‘ es of adjustment. (2nd Ed.) New York Pre 
Optimism and realism are blended appropriately, Hall, 1951. Pp. xii + 448. $5.65. 


as well as to themsely 


and the emphasis on self-responsibility for personal Che revision of Carroll’s Mental Hygiene 
improvement is constructive. t expanded from the first edition, a 
one-third of its material is new. The cha 
. r . motivation and learning re | 1 revised ext 
Cameron, Norman, AND Macaret, ANN. Behavior i ion and learning -_ een revised ex 


pathology. Boston: Houghton Mifflin, 1951. Pp Age tg. "agi mucept for : 
al 645 ec an sion of anxiety, the treatment of motivati 


ly enumerative. The discussion of learnit 

Let no one underestimate this book — it is ane ly improved and cites recent and pertinent 
of the most original and significant contemporary mental studies. The book remains a highly 
contributions to the understanding of human be- text for undergraduate instruction. Its short 
havior. With skill and creativity, the authors have with respect to organization and precision 
integrated the central recent insights about deviant set by the merits of its simple writing, its ol 
behavior: that it originates from learning. that the viewpoint, and its genuinely hygienic attitude 
learning takes place in social contexts and involves 
communication and role-taking, and that psychother DAVIDOFF, EUGENE, AND, NoetTzet, Evinor S$ 
apy is social relearning. The book inevitably invites child guidance approach to juvenile delin 

SS Seen New York: Child Care Publications, 19 

Note: Some reviews in this issue were prepared vi + 173. $4.50. 
by the Associate Editors, who may be identified by Pe? : : 
their initials. Unsigned reviews are by the Editor his is a loosely written, poorly organiz: has 
= ie F } lection of papers distantly related to the title of the 


$12 





k. Beginning with a most obvious discussion of 
he “organization of the child guidance clinic” and 
he functions of psychiatrists, psychologists, and 
ocial workers, the authors proceed to discuss the 
reatment of different “types” of delinquents. Much 
{this discussion consists of long lists of “types,” of 
do’s”’ and “don’t’s,” and of illustrative cases in 
shich the “dynamics” are intricately and uncon- 

cingly spun out. 





Several disconnected chapters 


} 
89 low on delinquents and drama techniques; on the 
— reatment of pathological liars (with a list of 36 
oe of liars) ; on “additional therapeutic consider- 
Ht J ns”; and on a “follow-up study of 374 children,” 
' ymong other chapters. The follow-up consisted es- 
entially of determining whether these children were 
e . r were not subsequently registered with the Social 
For Service Exchange, and little else. The case for the 
: hild guidance clinic in the treatment of juvenile 
1, ielinquents is not enhanced by this volume.—M. K. 
_ y, RicHarp, AND Humser, W. J. The develop- 
— ment of human behavior. New York: Macmillan, 
1951. Pp. xv + 762. $5.50. 

by Is A sociologist and a psychologist collaborated in 
a iring this interestingly integrated text, suitable 
and r introducing undergraduates to the social sciences. 
es The book’s five sections deal with foundations of 

nei 


xvchological and social theory, development from 
4 incy to old age, deviant roles and statuses, social 
titutions, and the 
degree of 
ugh mature 


problem of the good society. 
integration is a real 
readily 


achievement. 
label 
perceive the 
and will forever after be forti- 
against spurious compartmentalizations. Also, 
tyle is lively and readable, and the illustrative 
terial well chosen. It is unfortunate that the book 
some evidences of carelessness — a 
beled figure, a garbled reference — but these 
liable shortcomings do not entirely conceal its 
some ral excellence 


scholars can 


most 


the subject matter, students will 


k as “one course,” 


bed 


marred by 


LMER, B. von Haver. How to help your child 
velop successfully. New. York: Prentice-Hall, 
51. Pp. viii + 368. $3.95. 


experi- A book for parents, which gives questions and an- 
sirable MM "ers on typical parent-child problems, sketches de- 

gs Mim lopment from birth to ten years, and provides a 
ire off 2-page section for recording a child’s physical and 
jective ME chological development. The author takes a 
les uddle course between “training” and indulgence, 


1 writes with an open-minded good humor that 

5 The ae "ll help many parents and alarm few. Advocates 
‘the current trend toward full indulgence will not 

¢ the book; other psychologists who are no more 

2 accord with the excesses of Ribble than with 

my se of Watson will recommend it. The style of 
iting is breezy and appropriately nontechnical, 
ad the numerous illustrations are amusing. 
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Harrower, M. R., AND Steiner, M. E. Large scale 
Rorschach techniques. (2nd Ed.) Springfield, Lil: 
Charles C Thomas, 1951 $8.50 


for the Group 


Pp. xx + 353 


The second edition of the manual 


Rorschach and the Multiple Choice 
The main body of the 
ok is unchanged, save for the deletion of 


Test is not a 
revision in the usual sense. 
origina! 
nume:vus pages of superfluous graphs, and for a 
few minor editorial corrections. 
material is confined to a 53-page added section on 
“recent developments,” which does scant justice to 
its topic. Of the 34 “new” are 
dated 1947 or earlier. Moreover, the section’s poor 
organization gives evidence of hasty and careless 
preparation. A really up-to-date annotated bibli- 
ography would have been more serviceable. There 
is also a supposedly new section on “card-pull” that 
turns out to be a rearrangement of some old data 
on the relative frequencies of locations, determinants, 
and content categories. The second edition does not 
give nearly as good support as it might to the gen- 
uinely useful Group Rorschach. On the other hand, 
it maintains the unduly optimistic attitude of the 
first edition toward the Multiple Choice Test, in 
spite of the bulk of adverse evidence. 


Ihe principal new 


studies cited, 26 


HARTWELL, SAMUEL W. Citizens’ handbook of sex- 
ual abnormalities and the mental hygiene ap- 
proach to their prevention. Washington, D. C.: 
Public Affairs Press, 1951. Pp. viii + 71. $1.00. 


One of the reports of the Michigan Commission 
on the Deviated Criminal Sex Offender is this help- 
ful booklet for laymen, which takes up the types 
of sex deviations, common fallacies, and the pre- 
ventive mental hygiene of sex. The prevention of 
sex deviations is discussed mainly in terms of the 
psychoanalytic interpretation of parent-child rela- 
tionships. 


Janopa, MArie, DeutscuH, Morton, AND Cook, STu- 
ART W. Research methods in social relations. Vol. 
1: Basic processes. Vol. 2: Selected techniques 
New York: Dryden Press, 1951. 422, 
x + 337. $3.75 each vol., 


Pp. x + 
$6.00 set. 


This able reference work and textbook on social 
research methods was developed by a committee of 
the Society for the Psychological Study of Social 
Issues. The first volume, written mainly by Jahoda, 
Deutsch, and Cook, takes up the major steps of sci- 
entific inquiry: the formulation of a problem, re- 
search design, measurement, data collection, analysis 
and interpretation, and presentation and application. 
Volume two consists of eleven chapters by fifteen 
principal contributors that deal with some of the 
more specific technical problems of social research, 
such as constructing questionnaires, interviewing, 
observing, sample design, and scaling. Although 
the discussion of designs and techniques is focused 
mainly on the problem of prejudice, the material 
has broader applicability. Because the research 
problems and tools of social psychology are so close 
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to those of clinical psychology, the present work is 
probably the best available approximation to a hand 
book for clinical research. It will become a widely 
used resource for graduate students struggling with 
dissertations, for hard-pressed professors, and for 
clinical research workers generally. 


Jones, ARTHUR J. Principles of guidance and pupil 
personnel work, (4th Ed.) New York: McGraw- 
Hill, 1951. Pp. xvi + 630. $4.75. 


The author states that this edition “is not essential- 
ly different from the third in general organization 
and in fundamental concepts.” He views guidance 
as an aspect of personnel work, but argues for the 


continued use of the concept. Proportionally great 
space is devoted to the need, purpose, administration, 
and method of investigation in guidance. Special 
chapters describe guidance in elementary schools, in 
junior high schools, in secondary schools, in colleges, 
with out-of-school youth, and with Negro youth. Al- 
though Jones mentions recent literature in the field 
his whole frame of reference seems to the reviewer 
to reflect the emphasis at the time of his first edition 
rather than that of today. One might expect that, 
in 1951, more space would be devoted to the dynam- 
ics of human behavior which is to be guided rather 
than to the importance and machinery of guidance 


as such.—F. McK. 


Marx, MEetvin H. (Ed.) Psychological theory: con 
temporary readings. New York: Macmillan, 1951. 
Pp. xi + 585. $5.00. 


If the maturity of a science can be judged by the 
systematic and critical attention that it pays to the 
construction of its theories, then psychology has in- 
deed matured greatly in the past two decades. In 
what is perhaps the first book of its kind, Marx has 
effectively documented the contemporary growth of 
psychological theory. The well-selected papers are 
divided into two groups: the first twenty-three deal 
directly with problems of psychological theory con- 
struction; the remaining twenty-four illustrate the 
process from the writings of important recent psy- 
chological theorists. The total impression made by 
the papers is very different from that conveyed by 
the typical survey of “schools of psychology” that 
was so popular twenty or more years ago. There are 
no warring antagonisms. Instead, the contemporary 
viewpoints when placed side by side seem highly 
complementary to one another. One does not ask 
whether molar or molecular concepts, or behavior- 
istic or field-theoretical psychologies, are “right” or 
“wrong,” but sees instead their distinctive contribu- 
tions toward a common understanding. The collec- 
tion will be an indispensable reference for advanced 
students. 


MICHIGAN, STATE OF. Report of the Governor's 

Study Commission on the Deviated Criminal Sex 
Offender. Detroit, Mich.: Governor’s Study Com- 
mission, 1951. Pp. xii + 245. 


BOOKS AND 





rESTS 


A two-year study by a commission appointed 
the governor of Michigan led to a number of broad 
and constructive recommendations of procedurs 
dealing with sex deviates. Notable proposals ar 
for continued and extended research, for the ; 
vision of differentiated treatment facilities, for jp- 
determinate for treatable offenders, for 
y consideration of the mental health of the 
victims of sex crimes, for the education of police and 


sentences 
wecoieer 

id 
other public officers, and for preventive measures 
through parent-child education. The recommenda 
tions are supported by statistical data, by cas 
studies, and by reports of the deliberations of 
committees. 


Mupp, Emiry HARTsSHORNE. The practice of mar 
riage counseling. New York: Association Press 
1951. Pp. xx + 336. $4.50. 


A survey of the development and status of mar- 
riage counseling, with special reference to the work 
of the Philadelphia Marriage Council. The practical 
chapters on the functions and processes of counsel 
ing, with numerous case studies, provide solid train 
ing materials for persons preparing to be marriage 
counselors and for general counselors who encounter 
marital problems. There is an especially good dis- 
cussion of the relationships of the various profession- 
al groups that deal with aspects of marriage. An 
appendix gives condensed reports of the resources 
and practices of thirty-five marriage counseling 
agencies. 


Powers, Epwin, AND WitMeR, HELEN. An experi- 
ment in the prevention of delinquency. New York 
Columbia Univ. Press, 1951. Pp. xliii 649 
$6.00. 


This volume is a report of the Cambridge-Somer 
ville Youth Study, an action research designed t 
test the hypothesis that juvenile delinquency can be 
prevented by establishing a sustained, friendly re 
lationship between a boy and a counselor who can 
give him a satisfactory ego ideal. Although the 
original experimental design was a carefully con- 
trived one, changing personnel and the effects of 
the war years disrupted the standards that had beer 
set up. However, it was felt wise to report the re 
sults, even if incomplete. The study used two groups 
of boys. Each experimental boy had all the benefits 
of the counselor and of the study’s resources, while 
his control “twin” was influenced only by the or 
dinary events and situations that he encountered i 
his culture. In the report, Powers describes the de- 


sign and the findings and Witmer, in the role of § 


a critical outside observer, evaluates the successes 
and failures of the study. Gordon W. Allport con- 


tributes a challenging foreword. Not least among) 
the authors are the boys themselves, who give their 7 


own reactions to the experiment. This refreshingly 
frank and objective report is a “must” for all who 
are concerned with juvenile delinquency.—B. M. / 

















RECK 
Ewal 


7th 


This 


ns of 


1 ex 


iilat 
nenta’ 
ratme 








peri- 
Y ork 
649 


ymer 
ed to 
an be 
9 can 
h the 

con- 
cts of 
| been 
he re 
rroups 
enefits 
while 
he or- 
red in 
he de- 


ole of 


ecesses 
rt con- 





sponte 


among | 
e their | 


hingly 
Il who 
M.!1 


NEW BOOKS AND TESTS 


<HWEBEL, Mitton. The interests of pharmacists. 
New York: King’s Crown Press, 1951. Pp. xii + 
84. $1.75. 


In addition to developing a pharmacist scale for 
se Strong Vocational Interest Blank, this study ex- 
red the relationship of job satisfaction to interest 
Its 
jicable to the measurement 
action in other occupations. 


1rement. techniques and findings are ap- 


of interest and satis- 


“ceL, Dav. Frustration in adolescent youth. Fed- 
eral Security Agency, Office of Education, Bulletin 
1951, No. 1. Washington: U. S. Government Print- 
Office, 1951. Pp. vi + 65. 25¢. 


ine 


This pamphlet about adolescents and their prob- 
ms is addressed primarily to educators and school 
Its theoretical about 
from the phenomenological of 
yez and Combs and from Maier’s analysis of re- 
tion to frustration as behavior without a goal. 
xcept for a probably unjustified faith in question- 
sires as instruments for identifying frustrated per- 

the practical recommendations for prevention 
nd rehabilitation seem to be well conceived. 


inselors. orientation draws 


qually approach 


TRECKER, EpwArD A., EBAUGH, FRANKLIN G., AND 

EwALT, JAcK R. Practical clinical psychiatry. 

7th Ed.) Philadelphia: Blakiston, 1951. Pp. xii 
506. $7.00. 


This is the seventh, revised edition of what has 
vy become a leading textbook in the psychiatric 
‘ld. The authors’ names will attest to the sound- 
ss of the treatment, and the revision is a genuine 
1 extensive one. be recommended to 
nts and to practicing clinicians who may want 
wailable a reference book with a more medical 
entation than is found in the usual psychological 


ratment of the mental disorders.—W. A. H. 


It can stu- 


TESTS 


{LLPoRT, Gorpon W., VERNON, Puiip E., AND LInpD- 
zeEY, GARDNER. Study of Values, Revised Edition. 
Test booklets ($2.50 per 25) with manual, pp. 16. 
Boston: Houghton Mifflin, 1951 


After twenty years of wide and varied use, the 
udy of Values has been revised thoroughly. There 
ie changes both in content and in format. The 
1 “social” value, which had proved unreliable be- 
tuse of adherence to Spranger’s excessively broad 
‘inition, has been redefined in terms of altruistic 
' philanthropic interest. Most of the items have 
¥en revised, to simplify wordings, and to eliminate 
tdated and overspecific cultural references. The 
ew items give the impression that they are likely 
maintain their meanings with the passage of 
‘me. The manual gives an extensive report of the 
‘atistical characteristics of the revision. New and 
wised items were selected by three successive item 
ilyses, and each scale is internally consistent. 


st 
_ 
Si 


Scale reliabilities are substantially improved, rang- 
ing from .87 to .92 on a one-month retest. Scale ia- 
tercorrelations are mainly Norms are supplied 
in terms of the means and SD’s of 1,81¢ 


low 


college stu- 


dents, and are further analyzed in terms of sex and 

of occupational groups. The new format simplifies 

the process of scoring and eliminates tl need for 

a separate score sheet. In its excellent new revision, 

the Study of Values seems destined to several more 

decades of service as a tool for guidance f re 
search. 

Moore, Bruce V., Lapp, C. J., AND Grirrin, CHARLES 
H. Engineering and Physical Scien 1 ptitude 
Test, Revised Manual. New York: Psvchological 
Corp., 1951. Pp. 11 
The Engineering and Physical Science Aptitude 


Test, a 72-minute composite of parts on mathematics, 
formulation, physical science comprehension, arith- 
metic reasoning, verbal me- 
chanical comprehension, developed during 
World War II as a placement battery for accelerated 


comprehension, and 


was 


engineering programs. The revised manual contains 
validation data, and 
norms for college and employee groups. 


intercorrelations, percentile 


Roeper, Westey S., AND GRAHAM, Hersert B. Af- 
titude Tests for Occupations 9-adult. 6 
tests, 1 form of each: Personal-Social, Mechanical, 
General Sales, Clerical Routine, Computational, 
Scientific. 12 (20 )to 20 (23) min. each test, 107 
(125) min. total. Test booklets (10¢ ea., $1.25 to 
$2.00 per 25, according to test), manual, 
key; profile sheet (5¢) ; specimen set, 1 test (35¢) 
Los Angeles: California Test 1951. 


Grade 


52 


with 


Bureau, 


The Aptitude Tests for Occupations series con- 
sists of six test booklets, one for each of the areas 


covered. The Personal-Social and General Sales 
tests use described situations with multiple-choice 
questions. The Mechanical test employs pulley 


systems, pattern layouts, and tool recognition. Cleri- 
cal Routine includes name and number checking and 
spelling. The test arithmetic 
al calculations and estimates, and the Scientific test 


Computational uses 
consists mainly of reasoning and comprehension with 
scientific materials. Full data given on 
abilities, which mainly from .78 to .91. The 
test intercorrelations are moderate, mostly from .40 
to .70 for males, and lower for females. The cor 
relations of the tests with other similar 
stated in considerable detail and are, on the whole, 
satisfactory. Percentile for each test 
tabulated for each grade from 9 to 13, and separate- 
ly for males and females. In the main, these tests 
fulfill the current expectations of psychologists for 
a series to measure differentiated abilities. They 
fall decidedly short of ultimate ideals for test de- 
velopment in two respects. First, there is evi- 
dence that they measure “aptitude” as distinguished 
from achievement, and the magical word is likely 
to deceive the unwary. 


are reli- 


run 


tests are 


norms are 


no 


Second, although the tests 
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are recommended mainly for occupational guid- 
ance, they are not accompanied by evidence to show 
their power to predict successful adjustment to any 
vocation. The latter fault can be remedied by fur- 
ther research. 


SULLIVAN, ELIZABETH T., CLARK, WiLuis W., AND 
liecs, Ernest W. California Short-Form Test 
of Mental Maturity, 1950 S-Form. 5 levels: grades 
Kgn-1, 1-3, 4-8, 7-10, 9-adult. 1 
level. 20 (30) to 52 (70) 
Test booklets ($1.75 


record Scoreze 


form at each 


min., according to level 
with manual, key, 
sheet (7¢); IBM 
(4¢) ; specimen set, 1 


Los Angeles: California Test Bureau, 1950. 


per 25) 


sheet ° 


answer 


answer sheet level (35¢ 


The 
places 
measured, and less on the total MA 
test for 
parts, 


this mental ability test 


stress on the 


revision of group 


more differential aptitudes 
and 1Q. The 
seven 


stage of consists ot 


presumably 


each maturity 


comparable from level to level 


because of similar content. Scores are obtained for 
total mental factors, language factors, nonlanguage 
factors, spatial relationships, logical reasoning, nu- 
merical reasoning, and verbal concepts. Total score 
reliabilities from .92 to .95, and 
from .81 to 


at the upper age 


range part-score 


reliabilities .95, being generally higher 


The subtest intercorrela 


levels. 1 
tions, mainly from the .20’s to the .40’s, give some 


support to the differential interpretation of the pro- 





AND TESTS 


files. Norms are based on very large grou at 
were controlled with respect to age and school 
progress. No evidence is given, however, about the 


geographic or socioeconomic distribution of the 
normative subjects. Validity is defended ji ‘ 
of high correlation with the Stanford-Binet t 
the exact coefficient is not stated. No d 

presented on the test’s relationships to subsequent 


school achievement or to other appropriate criteria. 
In these respects, the information given in the 
manuals falls short of meeting desirable profession- 
al standards. 


We tts, F. L. Modified Alpha Examination, Form 
9, Revised Manual. New York: Psych cal 
Corp., 1951. Pp. 8. 


The revised manual for this ancient but 


casionally useful warhorse includes revised norms 
for high school pupils and for 1,331 adult mal¢ 


engineering employees. 


ALEXANDER G. 
Test, Re 


chological Corp., 1951. 


WESMAN, 


fication vised Manual. 


Pp. 15 


The PCT is a test for business and industrial use 


that yields separate verbal and numerical 
res. The 1951 edition of the manual reports a 
imber of studies of the test’s applications, j 


gives additional norms. 


Wesman Personnel Classi- 


New York: Psy- 
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INDEX TO VOLUME 15 
SUBJECT INDEX 


iCE, and personality inventory data, 160 

\djustment and values, index of, 257 

\dlerian therapy, 32 

inxiety, problem of, in client-centered therapy, 359 
ttitudes, toward self and others, 79 


whavior, psychologist’s, as area of research, 278 
wok reviews, 85, 167, 263, 354, 445, 512 


‘ss¢ reporting, psychological, at psychiatric staff 
conferences, 425 

erebral palsied children, handedness in, 9 
hildren, normal, situational play therapy with, 225 
‘ient-centered therapy, 72 

problem of anxiety in, 359 

‘ient responses, and counseling method, 216 

‘inical interns, procedures and methods in supervis- 
ng, 267 

‘inical psychologist, and mental defectives, 1 
(inical psychology, oscillation and progress in, 281 
trainees, supervision of in neuropsychiatric hospi- 

tal, 271 

where are we going, 171 

blor, effects of, on thematic fantasy, 412 
inferences, psychiatric staff, psychological case re- 
porting at, 425 

‘unseling method, and client responses, 216 
unselor, general, eclecticism of, 76 


hkterioration, in senile psychosis, 145 
linkers, psychometric study of, 501 
hive theory, and TAT responses, 45 


tnstellung-effect, as test of rigidity, 89 
stellung test, of rigidity, relation to concreteness 
f thinking, 303 

tectroconvulsive therapy, effect of, on psychological 
est performances, 430 

kperimental stress, Rorschach factors related to, 190 


htking, experimentally induced in TAT, 469 
frustration, induced, 18 
nduced in thematic apperception stories, 400 


foup conformity rating, Rosenzweig Picture-Frus- 
tration Study, 250 

widance center, community, supervision of interns 
in, 268 


landedness, in cerebral palsied children, 9 
larrower-Erickson multiple choice test, patients 
with spinal cord injuries, performance on, 346 
iewson ratios, 163 

tumm-W adsworth, and Minnesota Multiphasic, 457 


sight, two inventories of, 109 
wtelligence, epileptic, 392 


Intellectual classification, semantics of, 487 
Intelligence level, Rorschach “organic signs” and, 343 
Interns, in community guidance center, supervision 
of, 268 
Interview devices, projective pictures as, 405 
LP.A.T. Culture-Free Intelligence Scale 2 
IQ standardization of, 154 
1Q, .variability, in relation to age, 231 
on revised Stanford-Binet, 231 


Language, schizophrenic, regression of, 387 


Masculinity-femininity measures, comparison of, 484 
Mental defectives, and clinical psychologist, 1 
Mental hospital, patients, symptoms patterns in 

group of, 290 
Military personnel, nervous symptoms among, 320 
Minnesota Multiphasic Personality Inventory, 

and Humm-Wadsworth, 457 

factor analysis of fifteen scales of, 451 

internal structure of, 134 

item analysis aid for, 262 

norms, reliability and forms, 435 

profile coding, 82 

profiles in multiple sclerosis, 253 

profiles in patients with low-back pain, 350 

role-playing ability test derived from, 102 

Sc scale, 142 
Multiple sclerosis, MMPI profiles in, 253 


Nervous symptoms, among military personnel, rela- 
tion to age, combat experience, and marital status, 
320 

Neuropsychiatric hospital, clinical psychology trainees 
in, supervision of, 271 

Nondirective therapy, 32 

Normals, young, and schizophrenics, young, perform- 
ance on Wisconsin Card Sorting Test, 311 


Personality, self-evaluative questionnaires as projec- 
tive measures of, 496 
Projective measures of personality, self-evaluative 
questionnaires, 496 
Projective pictures, as interview devices, 405 
Projective technique, Michigan Picture Test, 51 
see also Rorschach Szondi; TAT 
Psychiatric symptoms, practical correlates of, 505 
Psychiatric syndromes, generality of, 372 
Psychoanalysis, transference in, 72 
Psychoanalytic therapy, 32 
Psychotherapy, accuracy of self-evaluations in, 95 
group, evaluation of by means of performance 
tests, 363 
hearing versus reading interviews, 115 
and insight, 109 
and role-playing ability, 102 
training in, 274 
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Psychotics, organic, symbol arrangement test, 439 
Punishment-reward expectancy in thematic apper- 
ception stories, 400 


Research, psychologist’s behavior as an area of, 278 
Research scientists, psychological tests of, 492 
Rigidity, Einstellung-effect as test of, 89 

Einstellung test of, 

thinking, 303 
Role-playing ability, test of, 102 
Rorschach and Behn-Rorschach, inkblot tests, com- 

parison of, 185 

cards, color as a determinant of responsiveness to, 

in schizophrenia, 196 

color and 8-9-10 per cent, 122 

descriptive diagnosis, 460 

factors related to experimental stress, 190 

forerunner of, 120 

impulsivity as indicated by, 464 

organic signs and intelligence level, 343 

norms, a contribution to, 178 

norms for “shock,” 127 

predicting continuation in therapy, 368 

test results, influence of set for area on, 337 
Rosenzweig Picture-Frustration 


relation to concreteness of 


study, conformity 


rating of, internal consistency of, 250 


Schizophrenic language, regression of, 387 
Schizophrenics, young, and normals, young, perform 
ance on Wisconsin Card Sorting Test, 311 
Scientists, research, psychological tests of, 492 
Self-evaluations, accuracy of, 95 
Self-evaluation questionnaires as projective measures 
of personality, 496 
Senile psychoses, relation of writing speed to, 243 
Situational play therapy, with normal children, 225 
Stanford-Binet, and Wechsler Intelligence Scale for 
Children, 236, 330 
and Wechsler Intelligence Scale for 


functioning on, 475 


Children, 
pupil 
IQ variability on, 231 

Symbol arrangement, original test of, validated on 
organic psychotics, 439 

Symptom patterns, in group of mental hospital pa- 
tients, 290 

Syphilitics, congenital, performance of, on Wechsler 
Intelligence Scale for Children, 239 

Szondi test, instability score, development of, 334 
four experiments with, 39 
note on Fosbere’s four experiments with, 511 

pictures, factors affecting the choice of, 210 

pictures, stimulus values of, 419 

valence distribution, 130 


Test, ACE, 160 

Einstellung-effect and rigidity, 89 

Einstellung, of rigidity, relation to concreteness of 
thinking, 303 
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Harrower-Erickson multiple choice, perfor e 
patients \ ith spinal cord injurie 

Humm-Wadsworth, 457 

I.P.A.T. Culture-Free Intelligence Scale, 2, 154 

Michigan Picture, 51 

MMPI, 82, 102, 134, 142, 350, 435, 457 

performance and evaluation of group psychother- 

apy, 363 

performance, psychological, effect of brief stimulus 

‘onvulsive therapy on, 430 

Rorschach, 120, 122, 127, 178, 185, 190, 196, 337, 

3 460, 464 

Szondi, 39, 130, 210, 334, 419, 505 

Stanford-Binet, 2321, 236, 330, 469 

symbol arrangement, 439 

Thematic Apperception, 45, 55, 203, 469 

Wechsler-Bellevue 68, 151, 163, 325 

Wechsler Intelligence Scale for Children, 62, 23¢ 


eCieci rin 


343. 368 


239, 330, 381, 385, 475 
Wisconsin Card Sorting, 311 
Thematic Appercepiion Test, and drive theory, 45 
card, ambiguity values of, 203 
experimentally induced faking of responses, 469 
two methods of administration, 55 
Thematic fantasy, some effects of color on, 412 
Thematic apperception stories, induced frustration 
and punishment-reward expectancy in, 400 
Therapist, reaction of patient to, 
und research device, 24 
Therapy, Adlerian 
brief stimulus electroconvulsive, effect of on ps 


used as training 


chological! test perfomance, 430 
client-centered, 72 
client-centered, problem of anxiety in, 359 
nondirective, 32 
prediction continuation in, by 
Rorschach test, 368 
psychoanalytic, 32, 72 
reaction of patient to therapist in, 24 
situational play, with normal children, 225 
Transference, in client-centered and psychoanalytic 
therapy, 72 


Voice, and clinical judgments, 115 


Wechsler-Bellevue, and descriptive diagnosis 
neurotics and schizophrenics, 151 
schizophrenia, 68 
validity of Hewson ratios, 163 

Wechsler Intelligence Scale for Children and Stan- 
ford-Binet, 236, 330, 475 
equivalent test and mental ages for, 381 
evidence on validity of, 385 
performance of congenital syphilitics on, 239 
Verbal and Performance IQ’s, 62 
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